ST LG PR MLJun 5, 2025

kTULA: A Langevin sampling algorithm with improved KL bounds under super-linear log-gradients

Iosif Lytras, Sotirios Sabanis, Ying Zhang

arXiv:2506.04878v12.32 citationsh-index: 20

Originality Highly original

AI Analysis

This addresses a key bottleneck in deep learning applications where global Lipschitz continuity fails, offering improved theoretical guarantees for sampling and optimization.

The paper tackles the problem of sampling from distributions with super-linearly growing log-gradients, common in deep learning, by proposing the kTULA algorithm, which achieves a non-asymptotic convergence bound in KL divergence with a rate of 2-ε, significantly improving over existing results.

Motivated by applications in deep learning, where the global Lipschitz continuity condition is often not satisfied, we examine the problem of sampling from distributions with super-linearly growing log-gradients. We propose a novel tamed Langevin dynamics-based algorithm, called kTULA, to solve the aforementioned sampling problem, and provide a theoretical guarantee for its performance. More precisely, we establish a non-asymptotic convergence bound in Kullback-Leibler (KL) divergence with the best-known rate of convergence equal to $2-\overlineε$, $\overlineε>0$, which significantly improves relevant results in existing literature. This enables us to obtain an improved non-asymptotic error bound in Wasserstein-2 distance, which can be used to further derive a non-asymptotic guarantee for kTULA to solve the associated optimization problems. To illustrate the applicability of kTULA, we apply the proposed algorithm to the problem of sampling from a high-dimensional double-well potential distribution and to an optimization problem involving a neural network. We show that our main results can be used to provide theoretical guarantees for the performance of kTULA.

View on arXiv PDF

Similar