CVAILGNEMay 18, 2022

Large Neural Networks Learning from Scratch with Very Few Data and without Explicit Regularization

arXiv:2205.08836v24 citationsh-index: 36
Originality Incremental advance
AI Analysis

This challenges traditional machine learning wisdom by showing that over-parameterized networks can generalize effectively with minimal data, potentially simplifying practical applications of CNNs in data-scarce scenarios.

The study demonstrates that very large convolutional neural networks can achieve high accuracy in fine-grained image classification using only a handful of training samples per class, without pretraining or explicit regularization, with results like 95% accuracy on a binary task with 20 samples per class.

Recent findings have shown that highly over-parameterized Neural Networks generalize without pretraining or explicit regularization. It is achieved with zero training error, i.e., complete over-fitting by memorizing the training data. This is surprising, since it is completely against traditional machine learning wisdom. In our empirical study we fortify these findings in the domain of fine-grained image classification. We show that very large Convolutional Neural Networks with millions of weights do learn with only a handful of training samples and without image augmentation, explicit regularization or pretraining. We train the architectures ResNet018, ResNet101 and VGG19 on subsets of the difficult benchmark datasets Caltech101, CUB_200_2011, FGVCAircraft, Flowers102 and StanfordCars with 100 classes and more, perform a comprehensive comparative study and draw implications for the practical application of CNNs. Finally, we show that a randomly initialized VGG19 with 140 million weights learns to distinguish airplanes and motorbikes with up to 95% accuracy using only 20 training samples per class.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes