CVIVAug 31, 2023

Robust GAN inversion

arXiv:2308.16510v1h-index: 51
Originality Incremental advance
AI Analysis

This addresses the problem of balancing low distortion and high editability in GAN inversion for image editing, though it appears incremental as it builds on existing StyleGAN models.

The paper tackles the challenge of GAN inversion for real image editing by proposing a method that works in the native latent space W and tunes the generator to restore details, achieving the lowest distortion with 4 times fewer parameters.

Recent advancements in real image editing have been attributed to the exploration of Generative Adversarial Networks (GANs) latent space. However, the main challenge of this procedure is GAN inversion, which aims to map the image to the latent space accurately. Existing methods that work on extended latent space $W+$ are unable to achieve low distortion and high editability simultaneously. To address this issue, we propose an approach which works in native latent space $W$ and tunes the generator network to restore missing image details. We introduce a novel regularization strategy with learnable coefficients obtained by training randomized StyleGAN 2 model - WRanGAN. This method outperforms traditional approaches in terms of reconstruction quality and computational efficiency, achieving the lowest distortion with 4 times fewer parameters. Furthermore, we observe a slight improvement in the quality of constructing hyperplanes corresponding to binary image attributes. We demonstrate the effectiveness of our approach on two complex datasets: Flickr-Faces-HQ and LSUN Church.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes