LGCVMLDec 17, 2018

Latent Dirichlet Allocation in Generative Adversarial Networks

arXiv:1812.06571v54 citations
Originality Incremental advance
AI Analysis

This addresses multimodal image generation for computer vision applications, but appears incremental as it builds on existing GAN methods.

The paper tackled multimodal image generation in GANs by introducing a Latent Dirichlet Allocation-based GAN (LDAGAN) that uses a Dirichlet prior to model underlying data structure, resulting in outstanding performance on real-world datasets.

We study the problem of multimodal generative modelling of images based on generative adversarial networks (GANs). Despite the success of existing methods, they often ignore the underlying structure of vision data or its multimodal generation characteristics. To address this problem, we introduce the Dirichlet prior for multimodal image generation, which leads to a new Latent Dirichlet Allocation based GAN (LDAGAN). In detail, for the generative process modelling, LDAGAN defines a generative mode for each sample, determining which generative sub-process it belongs to. For the adversarial training, LDAGAN derives a variational expectation-maximization (VEM) algorithm to estimate model parameters. Experimental results on real-world datasets have demonstrated the outstanding performance of LDAGAN over other existing GANs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes