ST CO ME MLOct 9, 2017

$α$-Variational Inference with Statistical Guarantees

Yun Yang, Debdeep Pati, Anirban Bhattacharya

arXiv:1710.03266v215.228 citations

Originality Highly original

AI Analysis

This provides a theoretical foundation for variational inference with convergence guarantees, addressing a key bottleneck in Bayesian computation for practitioners.

The authors introduced α-VB, a family of variational approximations to Bayesian posteriors with provable statistical guarantees, showing that maximizing the evidence lower bound minimizes Bayes risk and that point estimates converge at an optimal rate in various models.

We propose a family of variational approximations to Bayesian posterior distributions, called $α$-VB, with provable statistical guarantees. The standard variational approximation is a special case of $α$-VB with $α=1$. When $α\in(0,1]$, a novel class of variational inequalities are developed for linking the Bayes risk under the variational approximation to the objective function in the variational optimization problem, implying that maximizing the evidence lower bound in variational inference has the effect of minimizing the Bayes risk within the variational density family. Operating in a frequentist setup, the variational inequalities imply that point estimates constructed from the $α$-VB procedure converge at an optimal rate to the true parameter in a wide range of problems. We illustrate our general theory with a number of examples, including the mean-field variational approximation to (low)-high-dimensional Bayesian linear regression with spike and slab priors, mixture of Gaussian models, latent Dirichlet allocation, and (mixture of) Gaussian variational approximation in regular parametric models.

View on arXiv PDF

Similar