LG MLFeb 14, 2020

An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality

Silviu Pitis, Harris Chan, Kiarash Jamali, Jimmy Ba

arXiv:2002.05825v315.633 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses a limitation in modeling distances for machine learning tasks like graphs and reinforcement learning, though it is incremental as it builds on existing metric learning concepts.

The paper tackled the problem that deep metric learning architectures relying on Euclidean distance fail to model asymmetric or non-Euclidean subadditive distances common in graphs and reinforcement learning, and introduced novel architectures guaranteed to satisfy the triangle inequality, which outperformed existing metric approaches for graph distances and showed better inductive bias with limited data in multi-goal reinforcement learning.

Distances are pervasive in machine learning. They serve as similarity measures, loss functions, and learning targets; it is said that a good distance measure solves a task. When defining distances, the triangle inequality has proven to be a useful constraint, both theoretically--to prove convergence and optimality guarantees--and empirically--as an inductive bias. Deep metric learning architectures that respect the triangle inequality rely, almost exclusively, on Euclidean distance in the latent space. Though effective, this fails to model two broad classes of subadditive distances, common in graphs and reinforcement learning: asymmetric metrics, and metrics that cannot be embedded into Euclidean space. To address these problems, we introduce novel architectures that are guaranteed to satisfy the triangle inequality. We prove our architectures universally approximate norm-induced metrics on $\mathbb{R}^n$, and present a similar result for modified Input Convex Neural Networks. We show that our architectures outperform existing metric approaches when modeling graph distances and have a better inductive bias than non-metric approaches when training data is limited in the multi-goal reinforcement learning setting.

View on arXiv PDF Code

Similar