MLLGJan 3, 2012

Random Forests for Metric Learning with Implicit Pairwise Position Dependence

arXiv:1201.0610v173 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of efficient and accurate metric learning for heterogeneous data, offering a novel approach that balances performance and speed, though it is incremental in combining existing techniques.

The paper tackles the problem of metric learning for heterogeneous data by proposing a single metric that implicitly adapts its distance function throughout the feature space, using a random forest-based classifier. The method outperforms state-of-the-art global and multi-metric methods in accuracy (consistently ranked first) and is up to 16x faster than multi-metric methods.

Metric learning makes it plausible to learn distances for complex distributions of data from labeled data. However, to date, most metric learning methods are based on a single Mahalanobis metric, which cannot handle heterogeneous data well. Those that learn multiple metrics throughout the space have demonstrated superior accuracy, but at the cost of computational efficiency. Here, we take a new angle to the metric learning problem and learn a single metric that is able to implicitly adapt its distance function throughout the feature space. This metric adaptation is accomplished by using a random forest-based classifier to underpin the distance function and incorporate both absolute pairwise position and standard relative position into the representation. We have implemented and tested our method against state of the art global and multi-metric methods on a variety of data sets. Overall, the proposed method outperforms both types of methods in terms of accuracy (consistently ranked first) and is an order of magnitude faster than state of the art multi-metric methods (16x faster in the worst case).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes