ML LGJan 28, 2019

Scalable Metropolis-Hastings for Exact Bayesian Inference with Large Datasets

Robert Cornish, Paul Vanetti, Alexandre Bouchard-Côté, George Deligiannidis, Arnaud Doucet

arXiv:1901.09881v39.922 citationsHas Code

Originality Highly original

AI Analysis

This addresses the scalability problem for practitioners using Bayesian inference with large datasets, offering an exact solution rather than an incremental improvement over existing approximations.

The authors tackled the computational inefficiency of Bayesian inference on large datasets by proposing the Scalable Metropolis-Hastings (SMH) kernel, which reduces the average cost per step from O(n) to O(1) or O(1/√n) while maintaining exact posterior sampling, as demonstrated by theoretical and empirical performance gains over standard methods.

Bayesian inference via standard Markov Chain Monte Carlo (MCMC) methods is too computationally intensive to handle large datasets, since the cost per step usually scales like $Θ(n)$ in the number of data points $n$. We propose the Scalable Metropolis-Hastings (SMH) kernel that exploits Gaussian concentration of the posterior to require processing on average only $O(1)$ or even $O(1/\sqrt{n})$ data points per step. This scheme is based on a combination of factorized acceptance probabilities, procedures for fast simulation of Bernoulli processes, and control variate ideas. Contrary to many MCMC subsampling schemes such as fixed step-size Stochastic Gradient Langevin Dynamics, our approach is exact insofar as the invariant distribution is the true posterior and not an approximation to it. We characterise the performance of our algorithm theoretically, and give realistic and verifiable conditions under which it is geometrically ergodic. This theory is borne out by empirical results that demonstrate overall performance benefits over standard Metropolis-Hastings and various subsampling algorithms.

View on arXiv PDF Code

Similar