LGCVMLSep 20, 2020

Learning Soft Labels via Meta Learning

arXiv:2009.09496v137 citations
Originality Highly original
AI Analysis

This addresses the issue of suboptimal and noisy labels in supervised learning for tasks like image classification, offering a dynamic regularization method that is incremental but broadly applicable.

The paper tackles the problem of overfitting and poor generalization in supervised learning by proposing a framework that treats labels as learnable parameters, optimizing them dynamically with model parameters, resulting in consistent gains such as a 2.1% improvement for ResNet18 on CIFAR100 and outperforming state-of-the-art on noisy datasets.

One-hot labels do not represent soft decision boundaries among concepts, and hence, models trained on them are prone to overfitting. Using soft labels as targets provide regularization, but different soft labels might be optimal at different stages of optimization. Also, training with fixed labels in the presence of noisy annotations leads to worse generalization. To address these limitations, we propose a framework, where we treat the labels as learnable parameters, and optimize them along with model parameters. The learned labels continuously adapt themselves to the model's state, thereby providing dynamic regularization. When applied to the task of supervised image-classification, our method leads to consistent gains across different datasets and architectures. For instance, dynamically learned labels improve ResNet18 by 2.1% on CIFAR100. When applied to dataset containing noisy labels, the learned labels correct the annotation mistakes, and improves over state-of-the-art by a significant margin. Finally, we show that learned labels capture semantic relationship between classes, and thereby improve teacher models for the downstream task of distillation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes