STLGPRCOMLMay 23, 2023

Towards Understanding the Dynamics of Gaussian-Stein Variational Gradient Descent

arXiv:2305.14076v418 citations
Originality Incremental advance
AI Analysis

This provides incremental theoretical insights for researchers in variational inference and sampling methods, addressing a known bottleneck in understanding SVGD dynamics.

The paper tackles the theoretical understanding of Stein Variational Gradient Descent (SVGD) by analyzing Gaussian-SVGD, showing linear convergence to the closest Gaussian in KL divergence for strongly log-concave targets and proposing a unified framework that includes recent algorithms, with a new particle-based instance empirically outperforming existing approaches.

Stein Variational Gradient Descent (SVGD) is a nonparametric particle-based deterministic sampling algorithm. Despite its wide usage, understanding the theoretical properties of SVGD has remained a challenging problem. For sampling from a Gaussian target, the SVGD dynamics with a bilinear kernel will remain Gaussian as long as the initializer is Gaussian. Inspired by this fact, we undertake a detailed theoretical study of the Gaussian-SVGD, i.e., SVGD projected to the family of Gaussian distributions via the bilinear kernel, or equivalently Gaussian variational inference (GVI) with SVGD. We present a complete picture by considering both the mean-field PDE and discrete particle systems. When the target is strongly log-concave, the mean-field Gaussian-SVGD dynamics is proven to converge linearly to the Gaussian distribution closest to the target in KL divergence. In the finite-particle setting, there is both uniform in time convergence to the mean-field limit and linear convergence in time to the equilibrium if the target is Gaussian. In the general case, we propose a density-based and a particle-based implementation of the Gaussian-SVGD, and show that several recent algorithms for GVI, proposed from different perspectives, emerge as special cases of our unified framework. Interestingly, one of the new particle-based instance from this framework empirically outperforms existing approaches. Our results make concrete contributions towards obtaining a deeper understanding of both SVGD and GVI.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes