LG MLJul 11, 2016

Learning a metric for class-conditional KNN

arXiv:1607.03050v11.0Has Code

Originality Incremental advance

AI Analysis

This addresses the need for effective metric learning in domains beyond images, though it is incremental as it builds on NBNN and metric learning techniques.

The paper tackles the problem of naive Bayes nearest neighbor (NBNN) failing when data representations lack perceptual similarity, by proposing a class-conditional metric learning (CCML) method that optimizes a soft form of the NBNN selection rule. The result is that CCML clearly outperforms existing learned distance metrics across various image and non-image datasets in classification and retrieval tasks.

Naive Bayes Nearest Neighbour (NBNN) is a simple and effective framework which addresses many of the pitfalls of K-Nearest Neighbour (KNN) classification. It has yielded competitive results on several computer vision benchmarks. Its central tenet is that during NN search, a query is not compared to every example in a database, ignoring class information. Instead, NN searches are performed within each class, generating a score per class. A key problem with NN techniques, including NBNN, is that they fail when the data representation does not capture perceptual (e.g.~class-based) similarity. NBNN circumvents this by using independent engineered descriptors (e.g.~SIFT). To extend its applicability outside of image-based domains, we propose to learn a metric which captures perceptual similarity. Similar to how Neighbourhood Components Analysis optimizes a differentiable form of KNN classification, we propose "Class Conditional" metric learning (CCML), which optimizes a soft form of the NBNN selection rule. Typical metric learning algorithms learn either a global or local metric. However, our proposed method can be adjusted to a particular level of locality by tuning a single parameter. An empirical evaluation on classification and retrieval tasks demonstrates that our proposed method clearly outperforms existing learned distance metrics across a variety of image and non-image datasets.

View on arXiv PDF Code

Similar