CVLGNov 1, 2021

Projected GANs Converge Faster

arXiv:2111.01007v1300 citations
Originality Highly original
AI Analysis

This addresses the problem of inefficient and resource-intensive GAN training for researchers and practitioners in computer vision, offering a significant speed-up while maintaining or improving image quality.

The paper tackles the challenges of training Generative Adversarial Networks (GANs), such as slow convergence and high computational costs, by projecting samples into a pretrained feature space and mixing features across channels and resolutions, resulting in Projected GANs that match previous best Fréchet Inception Distance (FID) scores up to 40 times faster, reducing wall-clock time from 5 days to less than 3 hours.

Generative Adversarial Networks (GANs) produce high-quality images but are challenging to train. They need careful regularization, vast amounts of compute, and expensive hyper-parameter sweeps. We make significant headway on these issues by projecting generated and real samples into a fixed, pretrained feature space. Motivated by the finding that the discriminator cannot fully exploit features from deeper layers of the pretrained model, we propose a more effective strategy that mixes features across channels and resolutions. Our Projected GAN improves image quality, sample efficiency, and convergence speed. It is further compatible with resolutions of up to one Megapixel and advances the state-of-the-art Fréchet Inception Distance (FID) on twenty-two benchmark datasets. Importantly, Projected GANs match the previously lowest FIDs up to 40 times faster, cutting the wall-clock time from 5 days to less than 3 hours given the same computational resources.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes