CVJan 11, 2021

Multi-Domain Image-to-Image Translation with Adaptive Inference Graph

arXiv:2101.03806v17 citations
Originality Incremental advance
AI Analysis

This work addresses the computational cost limitations for multi-domain image-to-image translation, which is relevant for applications with limited resources.

This paper tackles the problem of multi-domain image-to-image translation under computational constraints. They propose an adaptive graph structure that allows the network to select specific sub-networks at inference time, leading to better image generation with fewer artifacts compared to existing methods on facial and painting datasets, while maintaining an almost constant computational cost.

In this work, we address the problem of multi-domain image-to-image translation with particular attention paid to computational cost. In particular, current state of the art models require a large and deep model in order to handle the visual diversity of multiple domains. In a context of limited computational resources, increasing the network size may not be possible. Therefore, we propose to increase the network capacity by using an adaptive graph structure. At inference time, the network estimates its own graph by selecting specific sub-networks. Sub-network selection is implemented using Gumbel-Softmax in order to allow end-to-end training. This approach leads to an adjustable increase in number of parameters while preserving an almost constant computational cost. Our evaluation on two publicly available datasets of facial and painting images shows that our adaptive strategy generates better images with fewer artifacts than literature methods

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes