CVMay 8, 2018

Learning image-to-image translation using paired and unpaired training samples

arXiv:1805.03189v145 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of limited paired data in image-to-image translation, offering a hybrid approach that improves performance for applications like computer vision.

The paper tackles the problem of image-to-image translation by proposing a model that uses both paired and unpaired training data simultaneously, outperforming strong baselines in both qualitative and quantitative results.

Image-to-image translation is a general name for a task where an image from one domain is converted to a corresponding image in another domain, given sufficient training data. Traditionally different approaches have been proposed depending on whether aligned image pairs or two sets of (unaligned) examples from both domains are available for training. While paired training samples might be difficult to obtain, the unpaired setup leads to a highly under-constrained problem and inferior results. In this paper, we propose a new general purpose image-to-image translation model that is able to utilize both paired and unpaired training data simultaneously. We compare our method with two strong baselines and obtain both qualitatively and quantitatively improved results. Our model outperforms the baselines also in the case of purely paired and unpaired training data. To our knowledge, this is the first work to consider such hybrid setup in image-to-image translation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes