STLGJun 28, 2017

Concentration of tempered posteriors and of their variational approximations

arXiv:1706.09293v3141 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of scaling Bayesian inference for large-scale applications in statistics and machine learning, providing theoretical guarantees for variational approximations, which is incremental but important for practical use.

The paper tackles the challenge of applying Bayesian methods to massive datasets by analyzing the theoretical properties of variational approximations of fractional posteriors, showing concentration results with concrete rates in examples like matrix completion and Gaussian VB.

While Bayesian methods are extremely popular in statistics and machine learning, their application to massive datasets is often challenging, when possible at all. Indeed, the classical MCMC algorithms are prohibitively slow when both the model dimension and the sample size are large. Variational Bayesian methods aim at approximating the posterior by a distribution in a tractable family. Thus, MCMC are replaced by an optimization algorithm which is orders of magnitude faster. VB methods have been applied in such computationally demanding applications as including collaborative filtering, image and video processing, NLP and text processing... However, despite very nice results in practice, the theoretical properties of these approximations are usually not known. In this paper, we propose a general approach to prove the concentration of variational approximations of fractional posteriors. We apply our theory to two examples: matrix completion, and Gaussian VB.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes