MLLGSep 26, 2017

On the regularization of Wasserstein GANs

arXiv:1709.08894v2221 citations
AI Analysis

This work addresses a specific training issue in generative adversarial networks for researchers, but it is incremental as it builds on existing regularization methods.

The paper tackles the problem of enforcing the Lipschitz constraint in Wasserstein GANs, which is crucial for stable training, by proposing a weaker regularization term that improves performance over weight clipping, as demonstrated on toy datasets.

Since their invention, generative adversarial networks (GANs) have become a popular approach for learning to model a distribution of real (unlabeled) data. Convergence problems during training are overcome by Wasserstein GANs which minimize the distance between the model and the empirical distribution in terms of a different metric, but thereby introduce a Lipschitz constraint into the optimization problem. A simple way to enforce the Lipschitz constraint on the class of functions, which can be modeled by the neural network, is weight clipping. It was proposed that training can be improved by instead augmenting the loss by a regularization term that penalizes the deviation of the gradient of the critic (as a function of the network's input) from one. We present theoretical arguments why using a weaker regularization term enforcing the Lipschitz constraint is preferable. These arguments are supported by experimental results on toy data sets.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes