LGAICVOct 28, 2021

Multi-Attribute Balanced Sampling for Disentangled GAN Controls

arXiv:2111.00909v210 citations
Originality Incremental advance
AI Analysis

This addresses the issue of biased and entangled semantic controls in GANs for researchers and practitioners in generative modeling, though it is incremental as it builds on existing classifier-based approaches.

The paper tackles the problem of entangled attribute edits in GAN-generated images by proposing a balanced sampling method to remove over-represented co-occurring attributes before training classifiers, showing it outperforms state-of-the-art methods on face manipulation tasks with PGGAN and StyleGAN on CelebAHQ and FFHQ datasets.

Various controls over the generated data can be extracted from the latent space of a pre-trained GAN, as it implicitly encodes the semantics of the training data. The discovered controls allow to vary semantic attributes in the generated images but usually lead to entangled edits that affect multiple attributes at the same time. Supervised approaches typically sample and annotate a collection of latent codes, then train classifiers in the latent space to identify the controls. Since the data generated by GANs reflects the biases of the original dataset, so do the resulting semantic controls. We propose to address disentanglement by subsampling the generated data to remove over-represented co-occuring attributes thus balancing the semantics of the dataset before training the classifiers. We demonstrate the effectiveness of this approach by extracting disentangled linear directions for face manipulation on two popular GAN architectures, PGGAN and StyleGAN, and two datasets, CelebAHQ and FFHQ. We show that this approach outperforms state-of-the-art classifier-based methods while avoiding the need for disentanglement-enforcing post-processing.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes