CVMay 27, 2021

Using Early-Learning Regularization to Classify Real-World Noisy Data

arXiv:2105.13244v21 citations
Originality Synthesis-oriented
AI Analysis

This work addresses label noise in computer vision for real-world applications, but it is incremental as it replicates and extends prior methods.

The study tackled the problem of classifying noisy real-world data by replicating Early-Learning Regularization and testing it on a real-world dataset, achieving consistent results and a 14.6 percentage point improvement with Sharpness-Aware Minimization.

The memorization problem is well-known in the field of computer vision. Liu et al. propose a technique called Early-Learning Regularization, which improves accuracy on the CIFAR datasets when label noise is present. This project replicates their experiments and investigates the performance on a real-world dataset with intrinsic noise. Results show that their experimental results are consistent. We also explore Sharpness-Aware Minimization in addition to SGD and observed a further 14.6 percentage points improvement. Future work includes using all 6 million images and manually clean a fraction of the images to fine-tune a transfer learning model. Last but not the least, having access to clean data for testing would also improve the measurement of accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes