CVLGJul 30, 2020

Contrastive Learning for Unpaired Image-to-Image Translation

arXiv:2007.15651v31604 citations
Originality Incremental advance
AI Analysis

This addresses the problem of generating realistic image translations without paired data for computer vision applications, representing an incremental improvement over existing methods.

The paper tackles unpaired image-to-image translation by proposing a contrastive learning method that maximizes mutual information between corresponding patches, enabling one-sided translation while improving quality and reducing training time.

In image-to-image translation, each patch in the output should reflect the content of the corresponding patch in the input, independent of domain. We propose a straightforward method for doing so -- maximizing mutual information between the two, using a framework based on contrastive learning. The method encourages two elements (corresponding patches) to map to a similar point in a learned feature space, relative to other elements (other patches) in the dataset, referred to as negatives. We explore several critical design choices for making contrastive learning effective in the image synthesis setting. Notably, we use a multilayer, patch-based approach, rather than operate on entire images. Furthermore, we draw negatives from within the input image itself, rather than from the rest of the dataset. We demonstrate that our framework enables one-sided translation in the unpaired image-to-image translation setting, while improving quality and reducing training time. In addition, our method can even be extended to the training setting where each "domain" is only a single image.

Code Implementations10 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes