CVFeb 7, 2023

High-Resolution GAN Inversion for Degraded Images in Large Diverse Datasets

Tencent
arXiv:2302.03406v19 citationsh-index: 23Has Code
Originality Incremental advance
AI Analysis

This addresses the need for a generic method to restore corrupted images, which is crucial for improving perception and downstream applications in fields like computer vision, though it appears incremental as it builds on existing GAN inversion techniques.

The paper tackles the problem of generating high-quality images from degraded inputs in large diverse datasets by proposing a novel GAN inversion framework using StyleGAN-XL, achieving preferable quantitative and qualitative results across multiple restoration tasks such as inpainting, colorization, and super-resolution.

The last decades are marked by massive and diverse image data, which shows increasingly high resolution and quality. However, some images we obtained may be corrupted, affecting the perception and the application of downstream tasks. A generic method for generating a high-quality image from the degraded one is in demand. In this paper, we present a novel GAN inversion framework that utilizes the powerful generative ability of StyleGAN-XL for this problem. To ease the inversion challenge with StyleGAN-XL, Clustering \& Regularize Inversion (CRI) is proposed. Specifically, the latent space is firstly divided into finer-grained sub-spaces by clustering. Instead of initializing the inversion with the average latent vector, we approximate a centroid latent vector from the clusters, which generates an image close to the input image. Then, an offset with a regularization term is introduced to keep the inverted latent vector within a certain range. We validate our CRI scheme on multiple restoration tasks (i.e., inpainting, colorization, and super-resolution) of complex natural images, and show preferable quantitative and qualitative results. We further demonstrate our technique is robust in terms of data and different GAN models. To our best knowledge, we are the first to adopt StyleGAN-XL for generating high-quality natural images from diverse degraded inputs. Code is available at https://github.com/Booooooooooo/CRI.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes