CVDec 31, 2019

OneGAN: Simultaneous Unsupervised Learning of Conditional Image Generation, Foreground Segmentation, and Fine-Grained Clustering

arXiv:1912.13471v252 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of reducing annotation costs for computer vision tasks, though it is incremental as it builds on existing GAN and VAE frameworks.

The paper tackles the problem of unsupervised learning for multiple image-related tasks, including conditional generation, segmentation, clustering, and object removal, achieving state-of-the-art results across all tasks without using annotations.

We present a method for simultaneously learning, in an unsupervised manner, (i) a conditional image generator, (ii) foreground extraction and segmentation, (iii) clustering into a two-level class hierarchy, and (iv) object removal and background completion, all done without any use of annotation. The method combines a Generative Adversarial Network and a Variational Auto-Encoder, with multiple encoders, generators and discriminators, and benefits from solving all tasks at once. The input to the training scheme is a varied collection of unlabeled images from the same domain, as well as a set of background images without a foreground object. In addition, the image generator can mix the background from one image, with a foreground that is conditioned either on that of a second image or on the index of a desired cluster. The method obtains state of the art results in comparison to the literature methods, when compared to the current state of the art in each of the tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes