LG MLJun 14, 2019

InfoGAN-CR and ModelCentrality: Self-supervised Model Training and Selection for Disentangling GANs

Zinan Lin, Kiran Koshy Thekumparampil, Giulia Fanti, Sewoong Oh

arXiv:1906.06034v319.546 citationsh-index: 37Has Code

Originality Highly original

AI Analysis

This work addresses the problem of training and selecting disentangled GANs for researchers in generative modeling, offering a self-supervised approach that improves performance without supervised data.

The paper tackles the challenge of training disentangled generative adversarial networks (GANs) by introducing self-supervised methods, achieving higher disentanglement scores than state-of-the-art VAE- and GAN-based approaches. It also proposes an unsupervised model selection scheme that finds models close to the best ones without requiring labeled data.

Disentangled generative models map a latent code vector to a target space, while enforcing that a subset of the learned latent codes are interpretable and associated with distinct properties of the target distribution. Recent advances have been dominated by Variational AutoEncoder (VAE)-based methods, while training disentangled generative adversarial networks (GANs) remains challenging. In this work, we show that the dominant challenges facing disentangled GANs can be mitigated through the use of self-supervision. We make two main contributions: first, we design a novel approach for training disentangled GANs with self-supervision. We propose contrastive regularizer, which is inspired by a natural notion of disentanglement: latent traversal. This achieves higher disentanglement scores than state-of-the-art VAE- and GAN-based approaches. Second, we propose an unsupervised model selection scheme called ModelCentrality, which uses generated synthetic samples to compute the medoid (multi-dimensional generalization of median) of a collection of models. The current common practice of hyper-parameter tuning requires using ground-truths samples, each labelled with known perfect disentangled latent codes. As real datasets are not equipped with such labels, we propose an unsupervised model selection scheme and show that it finds a model close to the best one, for both VAEs and GANs. Combining contrastive regularization with ModelCentrality, we improve upon the state-of-the-art disentanglement scores significantly, without accessing the supervised data.

View on arXiv PDF Code

Similar