CVJan 14, 2019

Dual Generator Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

Hao Tang, Dan Xu, Wei Wang, Yan Yan, Nicu Sebe

arXiv:1901.04604v112.746 citations

Originality Incremental advance

AI Analysis

This addresses the problem of inefficient multi-domain translation for researchers and practitioners in computer vision, though it is incremental as it builds on existing GAN methods.

The paper tackles the scalability and model collapse issues in multi-domain image-to-image translation by proposing a Dual Generator GAN (G^2GAN), which achieves superior model capacity and better generation performance across six datasets.

State-of-the-art methods for image-to-image translation with Generative Adversarial Networks (GANs) can learn a mapping from one domain to another domain using unpaired image data. However, these methods require the training of one specific model for every pair of image domains, which limits the scalability in dealing with more than two image domains. In addition, the training stage of these methods has the common problem of model collapse that degrades the quality of the generated images. To tackle these issues, we propose a Dual Generator Generative Adversarial Network (G$^2$GAN), which is a robust and scalable approach allowing to perform unpaired image-to-image translation for multiple domains using only dual generators within a single model. Moreover, we explore different optimization losses for better training of G$^2$GAN, and thus make unpaired image-to-image translation with higher consistency and better stability. Extensive experiments on six publicly available datasets with different scenarios, i.e., architectural buildings, seasons, landscape and human faces, demonstrate that the proposed G$^2$GAN achieves superior model capacity and better generation performance comparing with existing image-to-image translation GAN models.

View on arXiv PDF

Similar