MLLGFeb 1, 2019

Tree-Sliced Variants of Wasserstein Distances

arXiv:1902.00342v398 citations
AI Analysis

This work provides an incremental improvement for researchers in machine learning and statistics by offering a more efficient method for comparing probability distributions.

The authors tackled the computational and statistical drawbacks of optimal transport by introducing tree-sliced Wasserstein distances, which use random tree metrics to enable fast closed-form computations and achieve competitive performance on benchmark tasks.

Optimal transport (\OT) theory defines a powerful set of tools to compare probability distributions. \OT~suffers however from a few drawbacks, computational and statistical, which have encouraged the proposal of several regularized variants of OT in the recent literature, one of the most notable being the \textit{sliced} formulation, which exploits the closed-form formula between univariate distributions by projecting high-dimensional measures onto random lines. We consider in this work a more general family of ground metrics, namely \textit{tree metrics}, which also yield fast closed-form computations and negative definite, and of which the sliced-Wasserstein distance is a particular case (the tree is a chain). We propose the tree-sliced Wasserstein distance, computed by averaging the Wasserstein distance between these measures using random tree metrics, built adaptively in either low or high-dimensional spaces. Exploiting the negative definiteness of that distance, we also propose a positive definite kernel, and test it against other baselines on a few benchmark tasks.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes