CVAIAug 13, 2022

Self-supervised Matting-specific Portrait Enhancement and Generation

arXiv:2208.06601v14 citationsh-index: 30Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of expensive alpha matte annotation for researchers and practitioners in computer vision, though it is incremental as it builds on existing GAN and matting methods.

The paper tackles the ill-posed alpha matting problem by enhancing input portrait images to make alpha matte estimation easier for existing models, using GAN latent space optimization, and reports boosting automatic alpha matting performance by a large margin.

We resolve the ill-posed alpha matting problem from a completely different perspective. Given an input portrait image, instead of estimating the corresponding alpha matte, we focus on the other end, to subtly enhance this input so that the alpha matte can be easily estimated by any existing matting models. This is accomplished by exploring the latent space of GAN models. It is demonstrated that interpretable directions can be found in the latent space and they correspond to semantic image transformations. We further explore this property in alpha matting. Particularly, we invert an input portrait into the latent code of StyleGAN, and our aim is to discover whether there is an enhanced version in the latent space which is more compatible with a reference matting model. We optimize multi-scale latent vectors in the latent spaces under four tailored losses, ensuring matting-specificity and subtle modifications on the portrait. We demonstrate that the proposed method can refine real portrait images for arbitrary matting models, boosting the performance of automatic alpha matting by a large margin. In addition, we leverage the generative property of StyleGAN, and propose to generate enhanced portrait data which can be treated as the pseudo GT. It addresses the problem of expensive alpha matte annotation, further augmenting the matting performance of existing models. Code is available at~\url{https://github.com/cnnlstm/StyleGAN_Matting}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes