LGMLDec 19, 2013

Multimodal Transitions for Generative Stochastic Networks

arXiv:1312.5578v44 citations
Originality Incremental advance
AI Analysis

This work addresses a theoretical bottleneck in generative modeling for researchers, though it is incremental as it builds on existing GSN frameworks.

The paper tackled the limitation of unimodal transition operators in Generative Stochastic Networks (GSNs) by introducing multimodal transition distributions, specifically using NADE models, which improved sample generation capabilities as demonstrated in experiments.

Generative Stochastic Networks (GSNs) have been recently introduced as an alternative to traditional probabilistic modeling: instead of parametrizing the data distribution directly, one parametrizes a transition operator for a Markov chain whose stationary distribution is an estimator of the data generating distribution. The result of training is therefore a machine that generates samples through this Markov chain. However, the previously introduced GSN consistency theorems suggest that in order to capture a wide class of distributions, the transition operator in general should be multimodal, something that has not been done before this paper. We introduce for the first time multimodal transition distributions for GSNs, in particular using models in the NADE family (Neural Autoregressive Density Estimator) as output distributions of the transition operator. A NADE model is related to an RBM (and can thus model multimodal distributions) but its likelihood (and likelihood gradient) can be computed easily. The parameters of the NADE are obtained as a learned function of the previous state of the learned Markov chain. Experiments clearly illustrate the advantage of such multimodal transition distributions over unimodal GSNs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes