CVIVMay 16, 2023

Wavelet-based Unsupervised Label-to-Image Translation

arXiv:2305.09647v17 citations
Originality Highly original
AI Analysis

This addresses the need for photorealistic image generation from semantic layouts without requiring large paired datasets, which is incremental as it builds on existing unpaired translation methods.

The paper tackles the problem of semantic image synthesis without paired data by proposing an unsupervised paradigm using self-supervised segmentation and wavelet-based discrimination, achieving performance that bridges the gap between paired and unpaired models on three challenging datasets.

Semantic Image Synthesis (SIS) is a subclass of image-to-image translation where a semantic layout is used to generate a photorealistic image. State-of-the-art conditional Generative Adversarial Networks (GANs) need a huge amount of paired data to accomplish this task while generic unpaired image-to-image translation frameworks underperform in comparison, because they color-code semantic layouts and learn correspondences in appearance instead of semantic content. Starting from the assumption that a high quality generated image should be segmented back to its semantic layout, we propose a new Unsupervised paradigm for SIS (USIS) that makes use of a self-supervised segmentation loss and whole image wavelet based discrimination. Furthermore, in order to match the high-frequency distribution of real images, a novel generator architecture in the wavelet domain is proposed. We test our methodology on 3 challenging datasets and demonstrate its ability to bridge the performance gap between paired and unpaired models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes