CVDec 19, 2022

Out-of-domain GAN inversion via Invertibility Decomposition for Photo-Realistic Human Face Manipulation

arXiv:2212.09262v25 citationsh-index: 19
AI Analysis

This addresses fidelity issues in GAN inversion for real-world human face manipulation, but it is incremental as it builds on prior mask-based methods.

The paper tackles the problem of out-of-domain areas degrading fidelity in GAN inversion for human face images by proposing a framework that decomposes images into in-domain and out-of-domain partitions using invertibility masks, resulting in photo-realistic manipulation with demonstrated superiority over existing methods.

The fidelity of Generative Adversarial Networks (GAN) inversion is impeded by Out-Of-Domain (OOD) areas (e.g., background, accessories) in the image. Detecting the OOD areas beyond the generation ability of the pre-trained model and blending these regions with the input image can enhance fidelity. The "invertibility mask" figures out these OOD areas, and existing methods predict the mask with the reconstruction error. However, the estimated mask is usually inaccurate due to the influence of the reconstruction error in the In-Domain (ID) area. In this paper, we propose a novel framework that enhances the fidelity of human face inversion by designing a new module to decompose the input images to ID and OOD partitions with invertibility masks. Unlike previous works, our invertibility detector is simultaneously learned with a spatial alignment module. We iteratively align the generated features to the input geometry and reduce the reconstruction error in the ID regions. Thus, the OOD areas are more distinguishable and can be precisely predicted. Then, we improve the fidelity of our results by blending the OOD areas from the input image with the ID GAN inversion results. Our method produces photo-realistic results for real-world human face image inversion and manipulation. Extensive experiments demonstrate our method's superiority over existing methods in the quality of GAN inversion and attribute manipulation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes