CVGRLGMar 14, 2022

InsetGAN for Full-Body Image Generation

arXiv:2203.07293v168 citationsh-index: 73
AI Analysis

This addresses the challenge of generating diverse and realistic full-body human images for applications in computer graphics and AI, representing an incremental improvement over existing GAN methods.

The paper tackled the problem of generating full-body human images by combining multiple pretrained GANs, such as one for the body and others for parts like faces, to produce plausible-looking results without seams, as demonstrated through quantitative metrics and user studies.

While GANs can produce photo-realistic images in ideal conditions for certain domains, the generation of full-body human images remains difficult due to the diversity of identities, hairstyles, clothing, and the variance in pose. Instead of modeling this complex domain with a single GAN, we propose a novel method to combine multiple pretrained GANs, where one GAN generates a global canvas (e.g., human body) and a set of specialized GANs, or insets, focus on different parts (e.g., faces, shoes) that can be seamlessly inserted onto the global canvas. We model the problem as jointly exploring the respective latent spaces such that the generated images can be combined, by inserting the parts from the specialized generators onto the global canvas, without introducing seams. We demonstrate the setup by combining a full body GAN with a dedicated high-quality face GAN to produce plausible-looking humans. We evaluate our results with quantitative metrics and user studies.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes