CVLGApr 24, 2023

UTSGAN: Unseen Transition Suss GAN for Transition-Aware Image-to-image Translation

arXiv:2304.11955v11 citationsh-index: 72
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in I2I translation for producing high-quality outputs, representing an incremental advancement over previous methods.

The paper tackles the problem of ensuring consistency in Image-to-Image translation for complex and unseen attribute changes by introducing a transition-aware approach with transition consistency, resulting in improved performance demonstrated through experiments on four tasks across five datasets.

In the field of Image-to-Image (I2I) translation, ensuring consistency between input images and their translated results is a key requirement for producing high-quality and desirable outputs. Previous I2I methods have relied on result consistency, which enforces consistency between the translated results and the ground truth output, to achieve this goal. However, result consistency is limited in its ability to handle complex and unseen attribute changes in translation tasks. To address this issue, we introduce a transition-aware approach to I2I translation, where the data translation mapping is explicitly parameterized with a transition variable, allowing for the modelling of unobserved translations triggered by unseen transitions. Furthermore, we propose the use of transition consistency, defined on the transition variable, to enable regularization of consistency on unobserved translations, which is omitted in previous works. Based on these insights, we present Unseen Transition Suss GAN (UTSGAN), a generative framework that constructs a manifold for the transition with a stochastic transition encoder and coherently regularizes and generalizes result consistency and transition consistency on both training and unobserved translations with tailor-designed constraints. Extensive experiments on four different I2I tasks performed on five different datasets demonstrate the efficacy of our proposed UTSGAN in performing consistent translations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes