CVMar 11, 2019

Pluralistic Image Completion

arXiv:1903.04227v2494 citations
Originality Incremental advance
AI Analysis

This addresses the limitation of single-output image completion methods for applications requiring varied plausible solutions, though it is incremental in improving diversity over existing conditional VAE and GAN approaches.

The paper tackles the problem of generating multiple diverse completions for masked images, a task known as pluralistic image completion, and achieves higher-quality results with diverse outputs on datasets like Paris, CelebA-HQ, and ImageNet.

Most image completion methods produce only one result for each masked input, although there may be many reasonable possibilities. In this paper, we present an approach for \textbf{pluralistic image completion} -- the task of generating multiple and diverse plausible solutions for image completion. A major challenge faced by learning-based approaches is that usually only one ground truth training instance per label. As such, sampling from conditional VAEs still leads to minimal diversity. To overcome this, we propose a novel and probabilistically principled framework with two parallel paths. One is a reconstructive path that utilizes the only one given ground truth to get prior distribution of missing parts and rebuild the original image from this distribution. The other is a generative path for which the conditional prior is coupled to the distribution obtained in the reconstructive path. Both are supported by GANs. We also introduce a new short+long term attention layer that exploits distant relations among decoder and encoder features, improving appearance consistency. When tested on datasets with buildings (Paris), faces (CelebA-HQ), and natural images (ImageNet), our method not only generated higher-quality completion results, but also with multiple and diverse plausible outputs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes