LG MLOct 15, 2019

SGD Learns One-Layer Networks in WGANs

Qi Lei, Jason D. Lee, Alexandros G. Dimakis, Constantinos Daskalakis

arXiv:1910.07030v218.642 citations

Originality Incremental advance

AI Analysis

This provides theoretical guarantees for a widely used framework in generative modeling, addressing a key challenge in WGAN optimization.

The paper tackles the problem of training Wasserstein GANs (WGANs) with a one-layer generator, proving that stochastic gradient descent-ascent converges to a global solution with polynomial time and sample complexity.

Generative adversarial networks (GANs) are a widely used framework for learning generative models. Wasserstein GANs (WGANs), one of the most successful variants of GANs, require solving a minmax optimization problem to global optimality, but are in practice successfully trained using stochastic gradient descent-ascent. In this paper, we show that, when the generator is a one-layer network, stochastic gradient descent-ascent converges to a global solution with polynomial time and sample complexity.

View on arXiv PDF

Similar