CVSep 28, 2019

Semantic Example Guided Image-to-Image Translation

arXiv:1909.13028v27 citations
AI Analysis

This work addresses the need for more precise semantic control in image-to-image translation, which is incremental as it builds on prior multi-modal methods but focuses on preserving reference semantics.

The paper tackles the problem of controlling image-to-image translation outputs based on semantic references, rather than noise or latent vectors, by introducing a self-supervised framework that uses semantic matching and non-local blocks to improve output quality and diversity.

Many image-to-image (I2I) translation problems are in nature of high diversity that a single input may have various counterparts. Prior works proposed the multi-modal network that can build a many-to-many mapping between two visual domains. However, most of them are guided by sampled noises. Some others encode the reference images into a latent vector, by which the semantic information of the reference image will be washed away. In this work, we aim to provide a solution to control the output based on references semantically. Given a reference image and an input in another domain, a semantic matching is first performed between the two visual contents and generates the auxiliary image, which is explicitly encouraged to preserve semantic characteristics of the reference. A deep network then is used for I2I translation and the final outputs are expected to be semantically similar to both the input and the reference; however, no such paired data can satisfy that dual-similarity in a supervised fashion, so we build up a self-supervised framework to serve the training purpose. We improve the quality and diversity of the outputs by employing non-local blocks and a multi-task architecture. We assess the proposed method through extensive qualitative and quantitative evaluations and also presented comparisons with several state-of-art models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes