CVMay 24, 2018

Image-to-image translation for cross-domain disentanglement

arXiv:1805.09730v3258 citations
Originality Incremental advance
AI Analysis

This addresses the need for finer control and improved performance in image translation tasks for researchers and practitioners in computer vision, though it appears incremental as it builds on existing GAN and autoencoder frameworks.

The paper tackled the problem of disentangling internal representations in image-to-image translation by introducing cross-domain disentanglement to separate shared and domain-specific information, achieving better results than state-of-the-art methods in translation and cross-domain retrieval on challenging datasets.

Deep image translation methods have recently shown excellent results, outputting high-quality images covering multiple modes of the data distribution. There has also been increased interest in disentangling the internal representations learned by deep methods to further improve their performance and achieve a finer control. In this paper, we bridge these two objectives and introduce the concept of cross-domain disentanglement. We aim to separate the internal representation into three parts. The shared part contains information for both domains. The exclusive parts, on the other hand, contain only factors of variation that are particular to each domain. We achieve this through bidirectional image translation based on Generative Adversarial Networks and cross-domain autoencoders, a novel network component. Our model offers multiple advantages. We can output diverse samples covering multiple modes of the distributions of both domains, perform domain-specific image transfer and interpolation, and cross-domain retrieval without the need of labeled data, only paired images. We compare our model to the state-of-the-art in multi-modal image translation and achieve better results for translation on challenging datasets as well as for cross-domain retrieval on realistic datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes