LGCVDec 1, 2020

How to fine-tune deep neural networks in few-shot learning?

arXiv:2012.00204v118 citations
AI Analysis

This research addresses the practical problem of effectively adapting deep learning models to new tasks with limited data for practitioners using few-shot learning.

This paper investigates the optimal strategy for fine-tuning deep neural networks in few-shot learning scenarios. It experimentally compares different fine-tuning approaches for convolutional and batch normalization layers and analyzes model weights to validate the methods.

Deep learning has been widely used in data-intensive applications. However, training a deep neural network often requires a large data set. When there is not enough data available for training, the performance of deep learning models is even worse than that of shallow networks. It has been proved that few-shot learning can generalize to new tasks with few training samples. Fine-tuning of a deep model is simple and effective few-shot learning method. However, how to fine-tune deep learning models (fine-tune convolution layer or BN layer?) still lack deep investigation. Hence, we study how to fine-tune deep models through experimental comparison in this paper. Furthermore, the weight of the models is analyzed to verify the feasibility of the fine-tuning method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes