CVJun 28, 2018

CR-GAN: Learning Complete Representations for Multi-view Generation

arXiv:1806.11191v1158 citations
Originality Incremental advance
AI Analysis

This work solves a challenging problem in vision, graphics, and robotics with incremental improvements to existing GAN frameworks.

The paper tackles the problem of generating multi-view images from a single-view input by addressing incomplete representations in GANs, resulting in significantly improved generalization to unseen datasets and outperforming state-of-the-art methods.

Generating multi-view images from a single-view input is an essential yet challenging problem. It has broad applications in vision, graphics, and robotics. Our study indicates that the widely-used generative adversarial network (GAN) may learn "incomplete" representations due to the single-pathway framework: an encoder-decoder network followed by a discriminator network. We propose CR-GAN to address this problem. In addition to the single reconstruction path, we introduce a generation sideway to maintain the completeness of the learned embedding space. The two learning pathways collaborate and compete in a parameter-sharing manner, yielding considerably improved generalization ability to "unseen" dataset. More importantly, the two-pathway framework makes it possible to combine both labeled and unlabeled data for self-supervised learning, which further enriches the embedding space for realistic generations. The experimental results prove that CR-GAN significantly outperforms state-of-the-art methods, especially when generating from "unseen" inputs in wild conditions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes