ML LGJan 17, 2022

Minimax risk classifiers with 0-1 loss

Santiago Mazuelas, Mauricio Romero, Peter Grünwald

arXiv:2201.06487v610.810 citations

Originality Incremental advance

AI Analysis

This addresses the challenge of obtaining reliable error bounds in classification for practitioners, though it builds incrementally on existing minimax and kernel methods.

The paper tackles the problem of supervised classification with 0-1 loss by introducing minimax risk classifiers (MRCs) that minimize worst-case error probability with distributional uncertainty, showing they provide tight performance guarantees and strong universal consistency using characteristic kernels.

Supervised classification techniques use training samples to learn a classification rule with small expected 0-1 loss (error probability). Conventional methods enable tractable learning and provide out-of-sample generalization by using surrogate losses instead of the 0-1 loss and considering specific families of rules (hypothesis classes). This paper presents minimax risk classifiers (MRCs) that minize the worst-case 0-1 loss with respect to uncertainty sets of distributions that can include the underlying distribution, with a tunable confidence. We show that MRCs can provide tight performance guarantees at learning and are strongly universally consistent using feature mappings given by characteristic kernels. The paper also proposes efficient optimization techniques for MRC learning and shows that the methods presented can provide accurate classification together with tight performance guarantees in practice.

View on arXiv PDF

Similar