CVLGOct 19, 2021

Momentum Contrastive Autoencoder: Using Contrastive Learning for Latent Space Distribution Matching in WAE

arXiv:2110.10303v21 citations
Originality Incremental advance
AI Analysis

This work addresses a core problem in generative modeling for researchers, offering an incremental improvement over existing WAE methods.

The paper tackles the challenge of latent space distribution matching in Wasserstein autoencoders by using contrastive learning to optimize the loss, achieving faster convergence and more stable optimization, with improved FID scores on CelebA and CIFAR-10 datasets and realistic image quality on CelebA-HQ.

Wasserstein autoencoder (WAE) shows that matching two distributions is equivalent to minimizing a simple autoencoder (AE) loss under the constraint that the latent space of this AE matches a pre-specified prior distribution. This latent space distribution matching is a core component of WAE, and a challenging task. In this paper, we propose to use the contrastive learning framework that has been shown to be effective for self-supervised representation learning, as a means to resolve this problem. We do so by exploiting the fact that contrastive learning objectives optimize the latent space distribution to be uniform over the unit hyper-sphere, which can be easily sampled from. We show that using the contrastive learning framework to optimize the WAE loss achieves faster convergence and more stable optimization compared with existing popular algorithms for WAE. This is also reflected in the FID scores on CelebA and CIFAR-10 datasets, and the realistic generated image quality on the CelebA-HQ dataset.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes