CVApr 30, 2016

DisturbLabel: Regularizing CNN on the Loss Layer

arXiv:1605.00055v1263 citations
Originality Incremental advance
AI Analysis

This addresses over-fitting in CNNs for image recognition, but it is incremental as it builds on existing regularization techniques.

The paper tackles over-fitting in CNN training by introducing DisturbLabel, a simple algorithm that randomly replaces some labels with incorrect values during training, resulting in competitive recognition results on several image datasets.

During a long period of time we are combating over-fitting in the CNN training process with model regularization, including weight decay, model averaging, data augmentation, etc. In this paper, we present DisturbLabel, an extremely simple algorithm which randomly replaces a part of labels as incorrect values in each iteration. Although it seems weird to intentionally generate incorrect training labels, we show that DisturbLabel prevents the network training from over-fitting by implicitly averaging over exponentially many networks which are trained with different label sets. To the best of our knowledge, DisturbLabel serves as the first work which adds noises on the loss layer. Meanwhile, DisturbLabel cooperates well with Dropout to provide complementary regularization functions. Experiments demonstrate competitive recognition results on several popular image recognition datasets.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes