LG MLDec 7, 2020

Towards Generalized Implementation of Wasserstein Distance in GANs

Minkai Xu, Zhiming Zhou, Guansong Lu, Jian Tang, Weinan Zhang, Yong Yu

arXiv:2012.03420v29.07 citationsHas Code

Originality Highly original

AI Analysis

This work addresses the practical difficulties of satisfying the Lipschitz constraint in WGANs, which is a problem for researchers and practitioners aiming to leverage the theoretical soundness of WGANs.

This paper proposes a relaxation of the Lipschitz constraint in Wasserstein GANs (WGANs) by introducing Sobolev duality, a more general dual form of the Wasserstein distance. The authors demonstrate that this relaxed duality maintains favorable gradient properties and empirically show that their proposed Sobolev Wasserstein GAN (SWGAN) training scheme improves upon existing methods.

Wasserstein GANs (WGANs), built upon the Kantorovich-Rubinstein (KR) duality of Wasserstein distance, is one of the most theoretically sound GAN models. However, in practice it does not always outperform other variants of GANs. This is mostly due to the imperfect implementation of the Lipschitz condition required by the KR duality. Extensive work has been done in the community with different implementations of the Lipschitz constraint, which, however, is still hard to satisfy the restriction perfectly in practice. In this paper, we argue that the strong Lipschitz constraint might be unnecessary for optimization. Instead, we take a step back and try to relax the Lipschitz constraint. Theoretically, we first demonstrate a more general dual form of the Wasserstein distance called the Sobolev duality, which relaxes the Lipschitz constraint but still maintains the favorable gradient property of the Wasserstein distance. Moreover, we show that the KR duality is actually a special case of the Sobolev duality. Based on the relaxed duality, we further propose a generalized WGAN training scheme named Sobolev Wasserstein GAN (SWGAN), and empirically demonstrate the improvement of SWGAN over existing methods with extensive experiments.

View on arXiv PDF Code

Similar