LG MLNov 19, 2015

Iterative Refinement of the Approximate Posterior for Directed Belief Networks

R Devon Hjelm, Kyunghyun Cho, Junyoung Chung, Russ Salakhutdinov, Vince Calhoun, Nebojsa Jojic

arXiv:1511.06382v612.515 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the problem of high variance in gradient estimates for researchers using variational inference in machine learning, but it is incremental as it builds on existing recognition network methods.

The paper tackles the limited capacity of recognition networks in variational inference for directed graphical models, which constrains generative model power and increases Monte Carlo variance, by introducing an iterative refinement procedure that yields competitive performance with state-of-the-art methods and increases effective sample size.

Variational methods that rely on a recognition network to approximate the posterior of directed graphical models offer better inference and learning than previous methods. Recent advances that exploit the capacity and flexibility in this approach have expanded what kinds of models can be trained. However, as a proposal for the posterior, the capacity of the recognition network is limited, which can constrain the representational power of the generative model and increase the variance of Monte Carlo estimates. To address these issues, we introduce an iterative refinement procedure for improving the approximate posterior of the recognition network and show that training with the refined posterior is competitive with state-of-the-art methods. The advantages of refinement are further evident in an increased effective sample size, which implies a lower variance of gradient estimates.

View on arXiv PDF Code

Similar