ST LG PRJun 1, 2022

Convergence of Stein Variational Gradient Descent under a Weaker Smoothness Condition

Lukang Sun, Avetik Karagulyan, Peter Richtarik

arXiv:2206.00508v111.724 citationsh-index: 67

Originality Incremental advance

AI Analysis

This work addresses a theoretical limitation for researchers in machine learning and statistics by extending SVGD analysis to a broader class of distributions, though it is incremental as it builds on existing smoothness relaxations.

The paper tackles the problem of sampling from probability distributions with non-smooth potentials, such as polynomials of degree greater than 2, by analyzing the convergence of Stein Variational Gradient Descent (SVGD) under a weaker smoothness condition, proving a descent lemma and a complexity bound in terms of Stein Fisher information.

Stein Variational Gradient Descent (SVGD) is an important alternative to the Langevin-type algorithms for sampling from probability distributions of the form $π(x) \propto \exp(-V(x))$. In the existing theory of Langevin-type algorithms and SVGD, the potential function $V$ is often assumed to be $L$-smooth. However, this restrictive condition excludes a large class of potential functions such as polynomials of degree greater than $2$. Our paper studies the convergence of the SVGD algorithm for distributions with $(L_0,L_1)$-smooth potentials. This relaxed smoothness assumption was introduced by Zhang et al. [2019a] for the analysis of gradient clipping algorithms. With the help of trajectory-independent auxiliary conditions, we provide a descent lemma establishing that the algorithm decreases the $\mathrm{KL}$ divergence at each iteration and prove a complexity bound for SVGD in the population limit in terms of the Stein Fisher information.

View on arXiv PDF

Similar