CVIVJul 6, 2021

Independent Encoder for Deep Hierarchical Unsupervised Image-to-Image Translation

arXiv:2107.02494v1Has Code
AI Analysis

This work addresses image-to-image translation for computer vision applications, but it appears incremental as it builds on existing GAN-based methods with architectural modifications.

The paper tackles the challenge of generating realistic images while preserving source information in unsupervised image-to-image translation by proposing IEGAN, which uses an independent encoder to improve image quality and information retention, showing superiority over previous models in experiments.

The main challenges of image-to-image (I2I) translation are to make the translated image realistic and retain as much information from the source domain as possible. To address this issue, we propose a novel architecture, termed as IEGAN, which removes the encoder of each network and introduces an encoder that is independent of other networks. Compared with previous models, it embodies three advantages of our model: Firstly, it is more directly and comprehensively to grasp image information since the encoder no longer receives loss from generator and discriminator. Secondly, the independent encoder allows each network to focus more on its own goal which makes the translated image more realistic. Thirdly, the reduction in the number of encoders performs more unified image representation. However, when the independent encoder applies two down-sampling blocks, it's hard to extract semantic information. To tackle this problem, we propose deep and shallow information space containing characteristic and semantic information, which can guide the model to translate high-quality images under the task with significant shape or texture change. We compare IEGAN with other previous models, and conduct researches on semantic information consistency and component ablation at the same time. These experiments show the superiority and effectiveness of our architecture. Our code is published on: https://github.com/Elvinky/IEGAN.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes