MLLGJun 6, 2023

Binary Classification with Instance and Label Dependent Label Noise

arXiv:2306.03402v14 citationsh-index: 12
Originality Incremental advance
AI Analysis

This addresses a challenging issue in machine learning for scenarios with complex label noise, but it is incremental as it builds on existing theoretical frameworks.

The paper tackles the problem of binary classification with instance- and label-dependent label noise, showing that empirical risk minimization achieves the optimal excess risk bound proportional to the noise level, but learning solely with noisy samples is impossible without clean data or strong assumptions.

Learning with label dependent label noise has been extensively explored in both theory and practice; however, dealing with instance (i.e., feature) and label dependent label noise continues to be a challenging task. The difficulty arises from the fact that the noise rate varies for each instance, making it challenging to estimate accurately. The question of whether it is possible to learn a reliable model using only noisy samples remains unresolved. We answer this question with a theoretical analysis that provides matching upper and lower bounds. Surprisingly, our results show that, without any additional assumptions, empirical risk minimization achieves the optimal excess risk bound. Specifically, we derive a novel excess risk bound proportional to the noise level, which holds in very general settings, by comparing the empirical risk minimizers obtained from clean samples and noisy samples. Second, we show that the minimax lower bound for the 0-1 loss is a constant proportional to the average noise rate. Our findings suggest that learning solely with noisy samples is impossible without access to clean samples or strong assumptions on the distribution of the data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes