LGMLAug 4, 2018

Global Convergence to the Equilibrium of GANs using Variational Inequalities

arXiv:1808.01531v353 citations
Originality Highly original
AI Analysis

This addresses the instability in GAN training, a critical issue for researchers and practitioners in machine learning, by providing a theoretical convergence guarantee for a specific GAN variant.

The paper tackles the problem of training Generative Adversarial Networks (GANs) by analyzing the Wasserstein Linear-Quadratic GAN variant, showing that gradient descent diverges from equilibrium while a novel orthogonal direction, called Crossing-the-Curl, achieves convergence to the equilibrium.

In optimization, the negative gradient of a function denotes the direction of steepest descent. Furthermore, traveling in any direction orthogonal to the gradient maintains the value of the function. In this work, we show that these orthogonal directions that are ignored by gradient descent can be critical in equilibrium problems. Equilibrium problems have drawn heightened attention in machine learning due to the emergence of the Generative Adversarial Network (GAN). We use the framework of Variational Inequalities to analyze popular training algorithms for a fundamental GAN variant: the Wasserstein Linear-Quadratic GAN. We show that the steepest descent direction causes divergence from the equilibrium, and convergence to the equilibrium is achieved through following a particular orthogonal direction. We call this successful technique Crossing-the-Curl, named for its mathematical derivation as well as its intuition: identify the game's axis of rotation and move "across" space in the direction towards smaller "curling".

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes