CVMar 30, 2022

CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs

arXiv:2203.16521v125 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of explicit correspondence extraction in generative models for computer vision applications, representing an incremental advancement in GAN-based techniques.

The paper tackles the problem of extracting dense pixel-level correspondences from GANs by introducing CoordGAN, a structure-texture disentangled model that learns correspondence maps for generated images, and it quantitatively demonstrates improved segmentation mask transfer and better disentanglement compared to existing methods.

Recent advances show that Generative Adversarial Networks (GANs) can synthesize images with smooth variations along semantically meaningful latent directions, such as pose, expression, layout, etc. While this indicates that GANs implicitly learn pixel-level correspondences across images, few studies explored how to extract them explicitly. In this work, we introduce Coordinate GAN (CoordGAN), a structure-texture disentangled GAN that learns a dense correspondence map for each generated image. We represent the correspondence maps of different images as warped coordinate frames transformed from a canonical coordinate frame, i.e., the correspondence map, which describes the structure (e.g., the shape of a face), is controlled via a transformation. Hence, finding correspondences boils down to locating the same coordinate in different correspondence maps. In CoordGAN, we sample a transformation to represent the structure of a synthesized instance, while an independent texture branch is responsible for rendering appearance details orthogonal to the structure. Our approach can also extract dense correspondence maps for real images by adding an encoder on top of the generator. We quantitatively demonstrate the quality of the learned dense correspondences through segmentation mask transfer on multiple datasets. We also show that the proposed generator achieves better structure and texture disentanglement compared to existing approaches. Project page: https://jitengmu.github.io/CoordGAN/

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes