CVAILGMar 21, 2023

Fix the Noise: Disentangling Source Feature for Controllable Domain Translation

arXiv:2303.11545v17 citationsh-index: 43Has Code
AI Analysis

This addresses the challenge of achieving precise and smooth control in domain translation for image generation, which is an incremental improvement over existing methods.

The paper tackles the problem of controllable domain translation by preserving source features in a disentangled subspace, enabling smooth control over feature preservation with a single model. The method produces more consistent and realistic images than previous works, as shown in extensive experiments.

Recent studies show strong generative performance in domain translation especially by using transfer learning techniques on the unconditional generator. However, the control between different domain features using a single model is still challenging. Existing methods often require additional models, which is computationally demanding and leads to unsatisfactory visual quality. In addition, they have restricted control steps, which prevents a smooth transition. In this paper, we propose a new approach for high-quality domain translation with better controllability. The key idea is to preserve source features within a disentangled subspace of a target feature space. This allows our method to smoothly control the degree to which it preserves source features while generating images from an entirely new domain using only a single model. Our extensive experiments show that the proposed method can produce more consistent and realistic images than previous works and maintain precise controllability over different levels of transformation. The code is available at https://github.com/LeeDongYeun/FixNoise.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes