CVDec 14, 2017

MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels

arXiv:1712.05055v21666 citationsHas Code
Originality Highly original
AI Analysis

This addresses the issue of noisy labels in large-scale datasets for machine learning practitioners, offering a novel solution to improve generalization.

The paper tackles the problem of deep neural networks overfitting on corrupted labels by proposing MentorNet, a neural network that learns a data-driven curriculum to supervise training, achieving state-of-the-art results on the WebVision benchmark with 2.2 million images.

Recent deep networks are capable of memorizing the entire data even when the labels are completely random. To overcome the overfitting on corrupted labels, we propose a novel technique of learning another neural network, called MentorNet, to supervise the training of the base deep networks, namely, StudentNet. During training, MentorNet provides a curriculum (sample weighting scheme) for StudentNet to focus on the sample the label of which is probably correct. Unlike the existing curriculum that is usually predefined by human experts, MentorNet learns a data-driven curriculum dynamically with StudentNet. Experimental results demonstrate that our approach can significantly improve the generalization performance of deep networks trained on corrupted training data. Notably, to the best of our knowledge, we achieve the best-published result on WebVision, a large benchmark containing 2.2 million images of real-world noisy labels. The code are at https://github.com/google/mentornet

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes