CVNov 4, 2022

Contrastive Learning for Diverse Disentangled Foreground Generation

arXiv:2211.02707v18 citationsh-index: 73
Originality Incremental advance
AI Analysis

This addresses the limitation of existing image inpainting methods in producing diverse and controllable foreground generation, which is incremental but useful for applications like face editing.

The paper tackles the problem of generating diverse foreground images with explicit control over specific factors of variation, such as facial identity or expression, by introducing a contrastive learning method with latent codes that achieves superior diversity and controllability compared to state-of-the-art methods.

We introduce a new method for diverse foreground generation with explicit control over various factors. Existing image inpainting based foreground generation methods often struggle to generate diverse results and rarely allow users to explicitly control specific factors of variation (e.g., varying the facial identity or expression for face inpainting results). We leverage contrastive learning with latent codes to generate diverse foreground results for the same masked input. Specifically, we define two sets of latent codes, where one controls a pre-defined factor (``known''), and the other controls the remaining factors (``unknown''). The sampled latent codes from the two sets jointly bi-modulate the convolution kernels to guide the generator to synthesize diverse results. Experiments demonstrate the superiority of our method over state-of-the-arts in result diversity and generation controllability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes