MLLGMay 21, 2018

Bayesian posterior approximation via greedy particle optimization

arXiv:1805.07912v323 citations
Originality Highly original
AI Analysis

This addresses the challenge of efficient and theoretically grounded posterior approximation in Bayesian inference, particularly for high-dimensional problems, offering a solution that balances computational speed with proven convergence guarantees.

The paper tackles the problem of approximating Bayesian posterior distributions for complex models by proposing a novel method, MMD-FW, which minimizes maximum mean discrepancy greedily using the Frank-Wolfe algorithm, achieving computational efficiency and a finite sample convergence bound in linear order in finite dimensions.

In Bayesian inference, the posterior distributions are difficult to obtain analytically for complex models such as neural networks. Variational inference usually uses a parametric distribution for approximation, from which we can easily draw samples. Recently discrete approximation by particles has attracted attention because of its high expression ability. An example is Stein variational gradient descent (SVGD), which iteratively optimizes particles. Although SVGD has been shown to be computationally efficient empirically, its theoretical properties have not been clarified yet and no finite sample bound of the convergence rate is known. Another example is the Stein points (SP) method, which minimizes kernelized Stein discrepancy directly. Although a finite sample bound is assured theoretically, SP is computationally inefficient empirically, especially in high-dimensional problems. In this paper, we propose a novel method named maximum mean discrepancy minimization by the Frank-Wolfe algorithm (MMD-FW), which minimizes MMD in a greedy way by the FW algorithm. Our method is computationally efficient empirically and we show that its finite sample convergence bound is in a linear order in finite dimensions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes