MLLGJun 19, 2024

Generative Modeling by Minimizing the Wasserstein-2 Loss

arXiv:2406.13619v34 citations
Originality Incremental advance
AI Analysis

This provides a new gradient-flow method for generative modeling, potentially improving performance for machine learning practitioners, though it appears incremental as an alternative to existing Wasserstein-based approaches.

The paper tackles unsupervised learning by minimizing the Wasserstein-2 loss via a distribution-dependent ODE, showing exponential convergence to the true data distribution. In experiments, their algorithm outperforms Wasserstein GANs in low- and high-dimensional settings by adjusting persistent training.

This paper approaches the unsupervised learning problem by minimizing the second-order Wasserstein loss (the $W_2$ loss) through a distribution-dependent ordinary differential equation (ODE), whose dynamics involves the Kantorovich potential associated with the true data distribution and a current estimate of it. A main result shows that the time-marginal laws of the ODE form a gradient flow for the $W_2$ loss, which converges exponentially to the true data distribution. An Euler scheme for the ODE is proposed and it is shown to recover the gradient flow for the $W_2$ loss in the limit. An algorithm is designed by following the scheme and applying persistent training, which naturally fits our gradient-flow approach. In both low- and high-dimensional experiments, our algorithm outperforms Wasserstein generative adversarial networks by increasing the level of persistent training appropriately.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes