MLJun 22, 2016

Dealing with a large number of classes -- Likelihood, Discrimination or Ranking?

arXiv:1606.06959v25 citations
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of scalability in classification for applications with many classes, but it is incremental as it builds on existing approximations.

The paper tackles the problem of training probabilistic classifiers with a large number of classes by proposing a simple approach that approximates the likelihood, showing it works well on toy problems and is competitive with recent non-likelihood based approximations.

We consider training probabilistic classifiers in the case of a large number of classes. The number of classes is assumed too large to perform exact normalisation over all classes. To account for this we consider a simple approach that directly approximates the likelihood. We show that this simple approach works well on toy problems and is competitive with recently introduced alternative non-likelihood based approximations. Furthermore, we relate this approach to a simple ranking objective. This leads us to suggest a specific setting for the optimal threshold in the ranking objective.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes