MLLGJan 17, 2022

Minimax risk classifiers with 0-1 loss

arXiv:2201.06487v69 citations
AI Analysis

This addresses the challenge of obtaining reliable error bounds in classification for practitioners, though it builds incrementally on existing minimax and kernel methods.

The paper tackles the problem of supervised classification with 0-1 loss by introducing minimax risk classifiers (MRCs) that minimize worst-case error probability with distributional uncertainty, showing they provide tight performance guarantees and strong universal consistency using characteristic kernels.

Supervised classification techniques use training samples to learn a classification rule with small expected 0-1 loss (error probability). Conventional methods enable tractable learning and provide out-of-sample generalization by using surrogate losses instead of the 0-1 loss and considering specific families of rules (hypothesis classes). This paper presents minimax risk classifiers (MRCs) that minize the worst-case 0-1 loss with respect to uncertainty sets of distributions that can include the underlying distribution, with a tunable confidence. We show that MRCs can provide tight performance guarantees at learning and are strongly universally consistent using feature mappings given by characteristic kernels. The paper also proposes efficient optimization techniques for MRC learning and shows that the methods presented can provide accurate classification together with tight performance guarantees in practice.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes