CVIVFeb 22, 2023

Gradient Adjusting Networks for Domain Inversion

Meta AI
arXiv:2302.11413v11 citationsh-index: 63Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of manipulating real-world images with StyleGAN2 for users in computer vision and graphics, though it is incremental as it builds on existing inversion techniques.

The paper tackles the problem of inverting real-world images into StyleGAN2's latent space for editing by tuning the generator's weights, achieving almost perfect inversion while maintaining editing capabilities with a sizable performance gap over current state-of-the-art methods.

StyleGAN2 was demonstrated to be a powerful image generation engine that supports semantic editing. However, in order to manipulate a real-world image, one first needs to be able to retrieve its corresponding latent representation in StyleGAN's latent space that is decoded to an image as close as possible to the desired image. For many real-world images, a latent representation does not exist, which necessitates the tuning of the generator network. We present a per-image optimization method that tunes a StyleGAN2 generator such that it achieves a local edit to the generator's weights, resulting in almost perfect inversion, while still allowing image editing, by keeping the rest of the mapping between an input latent representation tensor and an output image relatively intact. The method is based on a one-shot training of a set of shallow update networks (aka. Gradient Modification Modules) that modify the layers of the generator. After training the Gradient Modification Modules, a modified generator is obtained by a single application of these networks to the original parameters, and the previous editing capabilities of the generator are maintained. Our experiments show a sizable gap in performance over the current state of the art in this very active domain. Our code is available at \url{https://github.com/sheffier/gani}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes