MLLGDec 1, 2021

Controlling Wasserstein Distances by Kernel Norms with Application to Compressive Statistical Learning

arXiv:2112.00423v317 citations
Originality Incremental advance
AI Analysis

This work addresses resource-efficient learning for large-scale data by summarizing training data into compact sketches, though it appears incremental as it builds on existing CSL theory.

The paper establishes conditions under which Wasserstein distances can be controlled by Maximum Mean Discrepancy (MMD) norms, motivated by compressive statistical learning (CSL) for resource-efficient large-scale learning. It introduces the Hölder Lower Restricted Isometric Property and provides guarantees for CSL by studying Wasserstein regularity of learning tasks.

Comparing probability distributions is at the crux of many machine learning algorithms. Maximum Mean Discrepancies (MMD) and Wasserstein distances are two classes of distances between probability distributions that have attracted abundant attention in past years. This paper establishes some conditions under which the Wasserstein distance can be controlled by MMD norms. Our work is motivated by the compressive statistical learning (CSL) theory, a general framework for resource-efficient large scale learning in which the training data is summarized in a single vector (called sketch) that captures the information relevant to the considered learning task. Inspired by existing results in CSL, we introduce the Hölder Lower Restricted Isometric Property and show that this property comes with interesting guarantees for compressive statistical learning. Based on the relations between the MMD and the Wasserstein distances, we provide guarantees for compressive statistical learning by introducing and studying the concept of Wasserstein regularity of the learning task, that is when some task-specific metric between probability distributions can be bounded by a Wasserstein distance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes