MLLGMay 19, 2017

Relaxed Wasserstein with Applications to GANs

arXiv:1705.07164v837 citations
Originality Incremental advance
AI Analysis

This addresses the problem of improving generative model flexibility and training efficiency for machine learning practitioners, representing an incremental advancement.

The paper tackled the limitations of Wasserstein GANs by proposing Relaxed Wasserstein distances using Bregman cost functions, resulting in RWGANs that outperform WGANs with gradient penalty on real image datasets.

Wasserstein Generative Adversarial Networks (WGANs) provide a versatile class of models, which have attracted great attention in various applications. However, this framework has two main drawbacks: (i) Wasserstein-1 (or Earth-Mover) distance is restrictive such that WGANs cannot always fit data geometry well; (ii) It is difficult to achieve fast training of WGANs. In this paper, we propose a new class of \textit{Relaxed Wasserstein} (RW) distances by generalizing Wasserstein-1 distance with Bregman cost functions. We show that RW distances achieve nice statistical properties while not sacrificing the computational tractability. Combined with the GANs framework, we develop Relaxed WGANs (RWGANs) which are not only statistically flexible but can be approximated efficiently using heuristic approaches. Experiments on real images demonstrate that the RWGAN with Kullback-Leibler (KL) cost function outperforms other competing approaches, e.g., WGANs, even with gradient penalty.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes