DS LGApr 2

Robust Learning with Optimal Error

arXiv:2604.0255563.8

AI Analysis

For learning theory researchers, this resolves open questions on the power of randomization in adversarial noise, providing tight optimal error bounds and closing gaps noted over decades.

This work constructs algorithms with optimal error for learning under adversarial noise models (malicious, nasty, agnostic), showing that randomized hypotheses achieve strictly better error rates than deterministic ones, e.g., optimal error of 0.5η/(1-η) for malicious noise (improving by factor 1/2) and η for agnostic noise (improving from 2η).

We construct algorithms with optimal error for learning with adversarial noise. The overarching theme of this work is that the use of \textsl{randomized} hypotheses can substantially improve upon the best error rates achievable with deterministic hypotheses. - For $Î·$-rate malicious noise, we show the optimal error is $\frac{1}{2} \cdot Î·/(1-Î·)$, improving on the optimal error of deterministic hypotheses by a factor of $1/2$. This answers an open question of Cesa-Bianchi et al. (JACM 1999) who showed randomness can improve error by a factor of $6/7$. - For $Î·$-rate nasty noise, we show the optimal error is $\frac{3}{2} \cdot Î·$ for distribution-independent learners and $Î·$ for fixed-distribution learners, both improving upon the optimal $2 Î·$ error of deterministic hypotheses. This closes a gap first noted by Bshouty et al. (Theoretical Computer Science 2002) when they introduced nasty noise and reiterated in the recent works of Klivans et al. (NeurIPS 2025) and Blanc et al. (SODA 2026). - For $Î·$-rate agnostic noise and the closely related nasty classification noise model, we show the optimal error is $Î·$, improving upon the optimal $2Î·$ error of deterministic hypotheses. All of our learners have sample complexity linear in the VC-dimension of the concept class and polynomial in the inverse excess error. All except for the fixed-distribution nasty noise learner are time efficient given access to an oracle for empirical risk minimization.

View on arXiv PDF

Similar