LG CV MLDec 17, 2018

Latent Dirichlet Allocation in Generative Adversarial Networks

Lili Pan, Shen Cheng, Jian Liu, Yazhou Ren, Zenglin Xu

arXiv:1812.06571v53.54 citations

Originality Incremental advance

AI Analysis

This addresses multimodal image generation for computer vision applications, but appears incremental as it builds on existing GAN methods.

The paper tackled multimodal image generation in GANs by introducing a Latent Dirichlet Allocation-based GAN (LDAGAN) that uses a Dirichlet prior to model underlying data structure, resulting in outstanding performance on real-world datasets.

We study the problem of multimodal generative modelling of images based on generative adversarial networks (GANs). Despite the success of existing methods, they often ignore the underlying structure of vision data or its multimodal generation characteristics. To address this problem, we introduce the Dirichlet prior for multimodal image generation, which leads to a new Latent Dirichlet Allocation based GAN (LDAGAN). In detail, for the generative process modelling, LDAGAN defines a generative mode for each sample, determining which generative sub-process it belongs to. For the adversarial training, LDAGAN derives a variational expectation-maximization (VEM) algorithm to estimate model parameters. Experimental results on real-world datasets have demonstrated the outstanding performance of LDAGAN over other existing GANs.

View on arXiv PDF

Similar