LGMLJun 11, 2018

A Fast and Easy Regression Technique for k-NN Classification Without Using Negative Pairs

arXiv:1806.03945v2
AI Analysis

This provides a fast and efficient alternative to existing metric learning methods for k-NN classification, particularly beneficial for large-scale applications, though it is incremental as it builds on prior metric learning approaches.

The paper tackles the problem of learning an effective dissimilarity function for k-NN classification by proposing a method that transforms only labeled objects while keeping query objects in original coordinates, achieving comparable or better accuracy than state-of-the-art metric learning methods on large document and image datasets and training over two orders of magnitude faster.

This paper proposes an inexpensive way to learn an effective dissimilarity function to be used for $k$-nearest neighbor ($k$-NN) classification. Unlike Mahalanobis metric learning methods that map both query (unlabeled) objects and labeled objects to new coordinates by a single transformation, our method learns a transformation of labeled objects to new points in the feature space whereas query objects are kept in their original coordinates. This method has several advantages over existing distance metric learning methods: (i) In experiments with large document and image datasets, it achieves $k$-NN classification accuracy better than or at least comparable to the state-of-the-art metric learning methods. (ii) The transformation can be learned efficiently by solving a standard ridge regression problem. For document and image datasets, training is often more than two orders of magnitude faster than the fastest metric learning methods tested. This speed-up is also due to the fact that the proposed method eliminates the optimization over "negative" object pairs, i.e., objects whose class labels are different. (iii) The formulation has a theoretical justification in terms of reducing hubness in data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes