LG MLJun 20, 2017

A Divergence Bound for Hybrids of MCMC and Variational Inference and an Application to Langevin Dynamics and SGVI

arXiv:1706.06529v16.910 citations

Originality Incremental advance

AI Analysis

This work addresses the efficiency gap in approximate inference methods for machine learning practitioners, offering a hybrid approach that is incremental in nature.

The paper tackles the trade-off between MCMC and variational inference by deriving a distribution over variational parameters to minimize divergence from the target distribution, with an example application to Langevin dynamics and SGVI.

Two popular classes of methods for approximate inference are Markov chain Monte Carlo (MCMC) and variational inference. MCMC tends to be accurate if run for a long enough time, while variational inference tends to give better approximations at shorter time horizons. However, the amount of time needed for MCMC to exceed the performance of variational methods can be quite high, motivating more fine-grained tradeoffs. This paper derives a distribution over variational parameters, designed to minimize a bound on the divergence between the resulting marginal distribution and the target, and gives an example of how to sample from this distribution in a way that interpolates between the behavior of existing methods based on Langevin dynamics and stochastic gradient variational inference (SGVI).

View on arXiv PDF

Similar