LGMLMar 5, 2020

Does label smoothing mitigate label noise?

arXiv:2003.02819v1411 citations
Originality Incremental advance
AI Analysis

This addresses the problem of noisy labels in machine learning for practitioners, offering an incremental analysis of an existing technique.

The paper investigates whether label smoothing helps mitigate label noise in deep learning, finding that it is competitive with specialized loss-correction techniques and beneficial when distilling models from noisy data, contrasting with noise-free scenarios.

Label smoothing is commonly used in training deep learning models, wherein one-hot training labels are mixed with uniform label vectors. Empirically, smoothing has been shown to improve both predictive performance and model calibration. In this paper, we study whether label smoothing is also effective as a means of coping with label noise. While label smoothing apparently amplifies this problem --- being equivalent to injecting symmetric noise to the labels --- we show how it relates to a general family of loss-correction techniques from the label noise literature. Building on this connection, we show that label smoothing is competitive with loss-correction under label noise. Further, we show that when distilling models from noisy data, label smoothing of the teacher is beneficial; this is in contrast to recent findings for noise-free problems, and sheds further light on settings where label smoothing is beneficial.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes