LGMLOct 31, 2018

Mixture Density Generative Adversarial Networks

arXiv:1811.00152v242 citations
Originality Incremental advance
AI Analysis

This addresses the mode collapse issue in GANs for image generation, offering an incremental improvement over existing methods.

The paper tackles the mode collapse problem in Generative Adversarial Networks by proposing Mixture Density GAN, which encourages the Discriminator to form clusters in its embedding space to help the Generator discover different data modes, resulting in high-quality images with strong similarity to real images as measured by Fréchet Inception Distance.

Generative Adversarial Networks have surprising ability for generating sharp and realistic images, though they are known to suffer from the so-called mode collapse problem. In this paper, we propose a new GAN variant called Mixture Density GAN that while being capable of generating high-quality images, overcomes this problem by encouraging the Discriminator to form clusters in its embedding space, which in turn leads the Generator to exploit these and discover different modes in the data. This is achieved by positioning Gaussian density functions in the corners of a simplex, using the resulting Gaussian mixture as a likelihood function over discriminator embeddings, and formulating an objective function for GAN training that is based on these likelihoods. We demonstrate empirically (1) the quality of the generated images in Mixture Density GAN and their strong similarity to real images, as measured by the Fréchet Inception Distance (FID), which compares very favourably with state-of-the-art methods, and (2) the ability to avoid mode collapse and discover all data modes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes