LGAIApr 3, 2025

Fourier Sliced-Wasserstein Embedding for Multisets and Measures

arXiv:2504.02544v211 citationsh-index: 15ICLR
Originality Highly original
AI Analysis

This work addresses the need for geometrically meaningful representations in machine learning for tasks involving multisets and measures, offering a novel solution with theoretical guarantees.

The paper tackles the problem of embedding multisets and measures into Euclidean space by introducing the Fourier Sliced-Wasserstein (FSW) embedding, which preserves sliced Wasserstein distances and achieves near-optimal output dimensions, leading to state-of-the-art performance in learning Wasserstein distances and improved robustness in PointNet with a 40-fold parameter reduction.

We present the Fourier Sliced-Wasserstein (FSW) embedding - a novel method to embed multisets and measures over $\mathbb{R}^d$ into Euclidean space. Our proposed embedding approximately preserves the sliced Wasserstein distance on distributions, thereby yielding geometrically meaningful representations that better capture the structure of the input. Moreover, it is injective on measures and bi-Lipschitz on multisets - a significant advantage over prevalent methods based on sum- or max-pooling, which are provably not bi-Lipschitz, and, in many cases, not even injective. The required output dimension for these guarantees is near-optimal: roughly $2 N d$, where $N$ is the maximal input multiset size. Furthermore, we prove that it is impossible to embed distributions over $\mathbb{R}^d$ into Euclidean space in a bi-Lipschitz manner. Thus, the metric properties of our embedding are, in a sense, the best possible. Through numerical experiments, we demonstrate that our method yields superior multiset representations that improve performance in practical learning tasks. Specifically, we show that (a) a simple combination of the FSW embedding with an MLP achieves state-of-the-art performance in learning the (non-sliced) Wasserstein distance; and (b) replacing max-pooling with the FSW embedding makes PointNet significantly more robust to parameter reduction, with only minor performance degradation even after a 40-fold reduction.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes