CVJul 4, 2022

Memory Efficient Patch-based Training for INR-based GANs

arXiv:2207.01395v22 citationsh-index: 12
Originality Incremental advance
AI Analysis

This addresses a computational bottleneck for researchers and practitioners using INR-based GANs in applications like super-resolution and outpainting, though it is an incremental improvement on existing methods.

The paper tackles the high computational cost of training implicit neural representation (INR)-based GANs, which scales with image resolution, by proposing a multi-stage patch-based training method that reduces GPU memory usage while maintaining competitive FID scores.

Recent studies have shown remarkable progress in GANs based on implicit neural representation (INR) - an MLP that produces an RGB value given its (x, y) coordinate. They represent an image as a continuous version of the underlying 2D signal instead of a 2D array of pixels, which opens new horizons for GAN applications (e.g., zero-shot super-resolution, image outpainting). However, training existing approaches require a heavy computational cost proportional to the image resolution, since they compute an MLP operation for every (x, y) coordinate. To alleviate this issue, we propose a multi-stage patch-based training, a novel and scalable approach that can train INR-based GANs with a flexible computational cost regardless of the image resolution. Specifically, our method allows to generate and discriminate by patch to learn the local details of the image and learn global structural information by a novel reconstruction loss to enable efficient GAN training. We conduct experiments on several benchmark datasets to demonstrate that our approach enhances baseline models in GPU memory while maintaining FIDs at a reasonable level.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes