MLLGNov 7, 2018

Wasserstein variational gradient descent: From semi-discrete optimal transport to ensemble variational inference

arXiv:1811.02827v214 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of flexible posterior approximation in machine learning, offering a novel approach that could improve accuracy in Bayesian inference tasks, though it appears incremental as it builds on existing particle-based and optimal transport methods.

The paper tackles the problem of approximating complex posterior distributions in variational inference by introducing a new particle-based method that minimizes a semi-discrete optimal transport divergence instead of the KL divergence, resulting in an algorithm that provides both particle approximations and optimal transportation densities for mapping particles to posterior segments.

Particle-based variational inference offers a flexible way of approximating complex posterior distributions with a set of particles. In this paper we introduce a new particle-based variational inference method based on the theory of semi-discrete optimal transport. Instead of minimizing the KL divergence between the posterior and the variational approximation, we minimize a semi-discrete optimal transport divergence. The solution of the resulting optimal transport problem provides both a particle approximation and a set of optimal transportation densities that map each particle to a segment of the posterior distribution. We approximate these transportation densities by minimizing the KL divergence between a truncated distribution and the optimal transport solution. The resulting algorithm can be interpreted as a form of ensemble variational inference where each particle is associated with a local variational approximation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes