MLLGJan 24, 2025

Optimal Transport Barycenter via Nonconvex-Concave Minimax Optimization

arXiv:2501.14635v26 citationsh-index: 5ICML
Originality Incremental advance
AI Analysis

This work addresses a fundamental bottleneck in optimal transport for researchers and practitioners, offering a scalable solution for applications like image processing, though it is incremental as it builds on prior primal-dual and geometric optimization frameworks.

The paper tackles the computationally challenging problem of computing the exact optimal transport barycenter for high-dimensional discretized probability distributions, introducing the WDHA algorithm that achieves nearly linear time and space complexity, demonstrating superior speed and accuracy over existing methods on high-resolution 2D data.

The optimal transport barycenter (a.k.a. Wasserstein barycenter) is a fundamental notion of averaging that extends from the Euclidean space to the Wasserstein space of probability distributions. Computation of the unregularized barycenter for discretized probability distributions on point clouds is a challenging task when the domain dimension $d > 1$. Most practical algorithms for approximating the barycenter problem are based on entropic regularization. In this paper, we introduce a nearly linear time $O(m \log{m})$ and linear space complexity $O(m)$ primal-dual algorithm, the Wasserstein-Descent $\dot{\mathbb{H}}^1$-Ascent (WDHA) algorithm, for computing the exact barycenter when the input probability density functions are discretized on an $m$-point grid. The key success of the WDHA algorithm hinges on alternating between two different yet closely related Wasserstein and Sobolev optimization geometries for the primal barycenter and dual Kantorovich potential subproblems. Under reasonable assumptions, we establish the convergence rate and iteration complexity of WDHA to its stationary point when the step size is appropriately chosen. Superior computational efficacy, scalability, and accuracy over the existing Sinkhorn-type algorithms are demonstrated on high-resolution (e.g., $1024 \times 1024$ images) 2D synthetic and real data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes