LGJul 18, 2022

Consistent Polyhedral Surrogates for Top-$k$ Classification and Variants

arXiv:2207.08873v115 citationsh-index: 20
Originality Incremental advance
AI Analysis

This addresses a theoretical gap in machine learning for extreme classification settings like information retrieval and image classification, though it appears incremental as it builds on prior frameworks.

The paper tackles the inconsistency of convex hinge-like surrogates for top-k classification by analyzing existing proposals and deriving constraints for their consistency, and introduces the first consistent polyhedral surrogate for this problem.

Top-$k$ classification is a generalization of multiclass classification used widely in information retrieval, image classification, and other extreme classification settings. Several hinge-like (piecewise-linear) surrogates have been proposed for the problem, yet all are either non-convex or inconsistent. For the proposed hinge-like surrogates that are convex (i.e., polyhedral), we apply the recent embedding framework of Finocchiaro et al. (2019; 2022) to determine the prediction problem for which the surrogate is consistent. These problems can all be interpreted as variants of top-$k$ classification, which may be better aligned with some applications. We leverage this analysis to derive constraints on the conditional label distributions under which these proposed surrogates become consistent for top-$k$. It has been further suggested that every convex hinge-like surrogate must be inconsistent for top-$k$. Yet, we use the same embedding framework to give the first consistent polyhedral surrogate for this problem.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes