CVLGNENov 19, 2018

SEIGAN: Towards Compositional Image Generation by Simultaneously Learning to Segment, Enhance, and Inpaint

arXiv:1811.07630v224 citations
Originality Incremental advance
AI Analysis

This work addresses image manipulation and understanding for computer vision applications, representing an incremental advancement by integrating multiple operations into a single model.

The paper tackles the problem of compositional image generation by developing SEIGAN, a model that simultaneously learns to segment, enhance, and inpaint images, resulting in better generated images as evaluated by human assessors and high-quality segmentation masks that improve over other adversarially trained approaches.

We present a novel approach to image manipulation and understanding by simultaneously learning to segment object masks, paste objects to another background image, and remove them from original images. For this purpose, we develop a novel generative model for compositional image generation, SEIGAN (Segment-Enhance-Inpaint Generative Adversarial Network), which learns these three operations together in an adversarial architecture with additional cycle consistency losses. To train, SEIGAN needs only bounding box supervision and does not require pairing or ground truth masks. SEIGAN produces better generated images (evaluated by human assessors) than other approaches and produces high-quality segmentation masks, improving over other adversarially trained approaches and getting closer to the results of fully supervised training.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes