MLIRLGSIJul 27, 2020

Robust Similarity and Distance Learning via Decision Forests

arXiv:2007.13843v2
AI Analysis

This addresses the need for more expressive distance learning methods without sacrificing theoretical properties, interpretability, or computational efficiency, though it appears incremental as it builds on existing forest-based approaches.

The authors tackled the problem of learning appropriate distances between items, which canonical distances like Euclidean often fail to capture, by proposing SMERF, a decision forest algorithm that approximates arbitrary distances and accurately predicts links in networks.

Canonical distances such as Euclidean distance often fail to capture the appropriate relationships between items, subsequently leading to subpar inference and prediction. Many algorithms have been proposed for automated learning of suitable distances, most of which employ linear methods to learn a global metric over the feature space. While such methods offer nice theoretical properties, interpretability, and computationally efficient means for implementing them, they are limited in expressive capacity. Methods which have been designed to improve expressiveness sacrifice one or more of the nice properties of the linear methods. To bridge this gap, we propose a highly expressive novel decision forest algorithm for the task of distance learning, which we call Similarity and Metric Random Forests (SMERF). We show that the tree construction procedure in SMERF is a proper generalization of standard classification and regression trees. Thus, the mathematical driving forces of SMERF are examined via its direct connection to regression forests, for which theory has been developed. Its ability to approximate arbitrary distances and identify important features is empirically demonstrated on simulated data sets. Last, we demonstrate that it accurately predicts links in networks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes