LGMLFeb 10, 2019

(q,p)-Wasserstein GANs: Comparing Ground Metrics for Wasserstein GANs

arXiv:1902.03642v116 citations
Originality Incremental advance
AI Analysis

This work addresses the optimization of generative models for researchers in machine learning, but it is incremental as it builds upon existing WGAN frameworks by exploring parameter variations.

The authors tackled the problem of improving Wasserstein GANs by generalizing the Wasserstein distance used in the objective function, introducing $(q,p)$-Wasserstein GANs that allow for more general $p$-Wasserstein metrics and $l^q$ ground metrics. They demonstrated that changing these parameters can notably improve performance over the standard case, as shown in experiments on MNIST and CIFAR-10 datasets.

Generative Adversial Networks (GANs) have made a major impact in computer vision and machine learning as generative models. Wasserstein GANs (WGANs) brought Optimal Transport (OT) theory into GANs, by minimizing the $1$-Wasserstein distance between model and data distributions as their objective function. Since then, WGANs have gained considerable interest due to their stability and theoretical framework. We contribute to the WGAN literature by introducing the family of $(q,p)$-Wasserstein GANs, which allow the use of more general $p$-Wasserstein metrics for $p\geq 1$ in the GAN learning procedure. While the method is able to incorporate any cost function as the ground metric, we focus on studying the $l^q$ metrics for $q\geq 1$. This is a notable generalization as in the WGAN literature the OT distances are commonly based on the $l^2$ ground metric. We demonstrate the effect of different $p$-Wasserstein distances in two toy examples. Furthermore, we show that the ground metric does make a difference, by comparing different $(q,p)$ pairs on the MNIST and CIFAR-10 datasets. Our experiments demonstrate that changing the ground metric and $p$ can notably improve on the common $(q,p) = (2,1)$ case.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes