STLGPRJun 1, 2022

Convergence of Stein Variational Gradient Descent under a Weaker Smoothness Condition

arXiv:2206.00508v124 citationsh-index: 67
Originality Incremental advance
AI Analysis

This work addresses a theoretical limitation for researchers in machine learning and statistics by extending SVGD analysis to a broader class of distributions, though it is incremental as it builds on existing smoothness relaxations.

The paper tackles the problem of sampling from probability distributions with non-smooth potentials, such as polynomials of degree greater than 2, by analyzing the convergence of Stein Variational Gradient Descent (SVGD) under a weaker smoothness condition, proving a descent lemma and a complexity bound in terms of Stein Fisher information.

Stein Variational Gradient Descent (SVGD) is an important alternative to the Langevin-type algorithms for sampling from probability distributions of the form $π(x) \propto \exp(-V(x))$. In the existing theory of Langevin-type algorithms and SVGD, the potential function $V$ is often assumed to be $L$-smooth. However, this restrictive condition excludes a large class of potential functions such as polynomials of degree greater than $2$. Our paper studies the convergence of the SVGD algorithm for distributions with $(L_0,L_1)$-smooth potentials. This relaxed smoothness assumption was introduced by Zhang et al. [2019a] for the analysis of gradient clipping algorithms. With the help of trajectory-independent auxiliary conditions, we provide a descent lemma establishing that the algorithm decreases the $\mathrm{KL}$ divergence at each iteration and prove a complexity bound for SVGD in the population limit in terms of the Stein Fisher information.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes