ML LGJan 27, 2023

LegendreTron: Uprising Proper Multiclass Loss Learning

Kevin Lam, Christian Walder, Spiridon Penev, Richard Nock

arXiv:2301.11695v34.31 citationsh-index: 36

Originality Incremental advance

AI Analysis

This addresses the challenge of avoiding ad hoc loss choices in multiclass supervised learning, though it is an incremental extension from binary to multiclass settings.

The paper tackles the problem of learning proper loss functions for multiclass classification, extending from binary to multiclass by using monotonic gradients of convex functions, and shows that their method consistently outperforms the baseline on datasets with over 10 classes with statistical significance.

Loss functions serve as the foundation of supervised learning and are often chosen prior to model development. To avoid potentially ad hoc choices of losses, statistical decision theory describes a desirable property for losses known as \emph{properness}, which asserts that Bayes' rule is optimal. Recent works have sought to \emph{learn losses} and models jointly. Existing methods do this by fitting an inverse canonical link function which monotonically maps $\mathbb{R}$ to $[0,1]$ to estimate probabilities for binary problems. In this paper, we extend monotonicity to maps between $\mathbb{R}^{C-1}$ and the projected probability simplex $\tildeΔ^{C-1}$ by using monotonicity of gradients of convex functions. We present {\sc LegendreTron} as a novel and practical method that jointly learns \emph{proper canonical losses} and probabilities for multiclass problems. Tested on a benchmark of domains with up to 1,000 classes, our experimental results show that our method consistently outperforms the natural multiclass baseline under a $t$-test at 99% significance on all datasets with greater than 10 classes.

View on arXiv PDF

Similar