CVDec 2, 2021

GANSeg: Learning to Segment by Unsupervised Hierarchical Image Generation

arXiv:2112.01036v328 citations
Originality Highly original
AI Analysis

This addresses the expensive annotation problem in segmentation for vision tasks, offering a novel unsupervised approach that is not incremental.

The paper tackles the problem of unsupervised image segmentation by proposing a GAN-based method that generates images conditioned on latent masks, eliminating the need for annotations. It outperforms state-of-the-art unsupervised methods on established benchmarks.

Segmenting an image into its parts is a frequent preprocess for high-level vision tasks such as image editing. However, annotating masks for supervised training is expensive. Weakly-supervised and unsupervised methods exist, but they depend on the comparison of pairs of images, such as from multi-views, frames of videos, and image augmentation, which limits their applicability. To address this, we propose a GAN-based approach that generates images conditioned on latent masks, thereby alleviating full or weak annotations required in previous approaches. We show that such mask-conditioned image generation can be learned faithfully when conditioning the masks in a hierarchical manner on latent keypoints that define the position of parts explicitly. Without requiring supervision of masks or points, this strategy increases robustness to viewpoint and object positions changes. It also lets us generate image-mask pairs for training a segmentation network, which outperforms the state-of-the-art unsupervised segmentation methods on established benchmarks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes