CVNov 11, 2020

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs

arXiv:2011.05867v118 citations
AI Analysis

This addresses a bottleneck in image-to-image translation for domains requiring large shape changes, with incremental improvements in performance and scalability to over 100 classes.

The paper tackles the problem of inferior performance in image-to-image translation when large shape changes are required, by proposing DeepI2I, a deep hierarchical method that leverages hierarchical features and transfer learning from pre-trained GANs, resulting in at least a 35% decrease in mFID compared to state-of-the-art on three datasets.

Image-to-image translation has recently achieved remarkable results. But despite current success, it suffers from inferior performance when translations between classes require large shape changes. We attribute this to the high-resolution bottlenecks which are used by current state-of-the-art image-to-image methods. Therefore, in this work, we propose a novel deep hierarchical Image-to-Image Translation method, called DeepI2I. We learn a model by leveraging hierarchical features: (a) structural information contained in the shallow layers and (b) semantic information extracted from the deep layers. To enable the training of deep I2I models on small datasets, we propose a novel transfer learning method, that transfers knowledge from pre-trained GANs. Specifically, we leverage the discriminator of a pre-trained GANs (i.e. BigGAN or StyleGAN) to initialize both the encoder and the discriminator and the pre-trained generator to initialize the generator of our model. Applying knowledge transfer leads to an alignment problem between the encoder and generator. We introduce an adaptor network to address this. On many-class image-to-image translation on three datasets (Animal faces, Birds, and Foods) we decrease mFID by at least 35% when compared to the state-of-the-art. Furthermore, we qualitatively and quantitatively demonstrate that transfer learning significantly improves the performance of I2I systems, especially for small datasets. Finally, we are the first to perform I2I translations for domains with over 100 classes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes