MLLGCOMar 11, 2022

Bayesian inference via sparse Hamiltonian flows

arXiv:2203.05723v20.2214 citationsh-index: 20
AI Analysis55

This work addresses computational bottlenecks in Bayesian inference for data-intensive applications, representing an incremental improvement over prior coreset methods.

The paper tackles the challenge of efficiently constructing Bayesian coresets for faster inference by introducing sparse Hamiltonian flows, which achieve accurate posterior approximations with significantly reduced runtime compared to existing methods.

A Bayesian coreset is a small, weighted subset of data that replaces the full dataset during Bayesian inference, with the goal of reducing computational cost. Although past work has shown empirically that there often exists a coreset with low inferential error, efficiently constructing such a coreset remains a challenge. Current methods tend to be slow, require a secondary inference step after coreset construction, and do not provide bounds on the data marginal evidence. In this work, we introduce a new method -- sparse Hamiltonian flows -- that addresses all three of these challenges. The method involves first subsampling the data uniformly, and then optimizing a Hamiltonian flow parametrized by coreset weights and including periodic momentum quasi-refreshment steps. Theoretical results show that the method enables an exponential compression of the dataset in a representative model, and that the quasi-refreshment steps reduce the KL divergence to the target. Real and synthetic experiments demonstrate that sparse Hamiltonian flows provide accurate posterior approximations with significantly reduced runtime compared with competing dynamical-system-based inference methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes