CVJan 14, 2019

Dual Generator Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

arXiv:1901.04604v146 citations
Originality Incremental advance
AI Analysis

This addresses the problem of inefficient multi-domain translation for researchers and practitioners in computer vision, though it is incremental as it builds on existing GAN methods.

The paper tackles the scalability and model collapse issues in multi-domain image-to-image translation by proposing a Dual Generator GAN (G^2GAN), which achieves superior model capacity and better generation performance across six datasets.

State-of-the-art methods for image-to-image translation with Generative Adversarial Networks (GANs) can learn a mapping from one domain to another domain using unpaired image data. However, these methods require the training of one specific model for every pair of image domains, which limits the scalability in dealing with more than two image domains. In addition, the training stage of these methods has the common problem of model collapse that degrades the quality of the generated images. To tackle these issues, we propose a Dual Generator Generative Adversarial Network (G$^2$GAN), which is a robust and scalable approach allowing to perform unpaired image-to-image translation for multiple domains using only dual generators within a single model. Moreover, we explore different optimization losses for better training of G$^2$GAN, and thus make unpaired image-to-image translation with higher consistency and better stability. Extensive experiments on six publicly available datasets with different scenarios, i.e., architectural buildings, seasons, landscape and human faces, demonstrate that the proposed G$^2$GAN achieves superior model capacity and better generation performance comparing with existing image-to-image translation GAN models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes