MLDSLGNov 24, 2018

MEMOIR: Multi-class Extreme Classification with Inexact Margin

arXiv:1811.09863v2
Originality Incremental advance
AI Analysis

This addresses the computational bottleneck for researchers and practitioners dealing with extreme classification problems, though it is an incremental improvement over existing methods.

The paper tackles the computational intractability of exact margin estimation in multi-class extreme classification by using approximate nearest neighbor search with locality-sensitive hashing, reducing training and prediction time without significant performance loss, as shown on five large-scale datasets.

Multi-class classification with a very large number of classes, or extreme classification, is a challenging problem from both statistical and computational perspectives. Most of the classical approaches to multi-class classification, including one-vs-rest or multi-class support vector machines, require the exact estimation of the classifier's margin, at both the training and the prediction steps making them intractable in extreme classification scenarios. In this paper, we study the impact of computing an approximate margin using nearest neighbor (ANN) search structures combined with locality-sensitive hashing (LSH). This approximation allows to dramatically reduce both the training and the prediction time without a significant loss in performance. We theoretically prove that this approximation does not lead to a significant loss of the risk of the model and provide empirical evidence over five publicly available large scale datasets, showing that the proposed approach is highly competitive with respect to state-of-the-art approaches on time, memory and performance measures.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes