MLLGFeb 10, 2021

On the Existence of Optimal Transport Gradient for Learning Generative Models

arXiv:2102.05542v17 citations
Originality Incremental advance
AI Analysis

This work addresses a theoretical and practical problem for researchers and practitioners using WGANs, offering incremental improvements to enhance stability in training generative models.

The paper tackles the issue that the gradient of optimal transport cost used in Wasserstein Generative Adversarial Networks (WGAN) may not be defined, leading to numerical instabilities, and addresses this by providing a valid differentiation theorem for entropic regularized transport and proposing an optimization algorithm, showing numerical advantages.

The use of optimal transport cost for learning generative models has become popular with Wasserstein Generative Adversarial Networks (WGAN). Training of WGAN relies on a theoretical background: the calculation of the gradient of the optimal transport cost with respect to the generative model parameters. We first demonstrate that such gradient may not be defined, which can result in numerical instabilities during gradient-based optimization. We address this issue by stating a valid differentiation theorem in the case of entropic regularized transport and specify conditions under which existence is ensured. By exploiting the discrete nature of empirical data, we formulate the gradient in a semi-discrete setting and propose an algorithm for the optimization of the generative model parameters. Finally, we illustrate numerically the advantage of the proposed framework.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes