MLAILGDec 10, 2024

Sequential Controlled Langevin Diffusions

arXiv:2412.07081v246 citationsh-index: 19ICLR
Originality Highly original
AI Analysis

This work addresses sampling challenges in machine learning and statistics by integrating complementary approaches, offering a more efficient solution for practitioners dealing with complex distributions.

The paper tackles the problem of sampling from unnormalized densities by combining Sequential Monte Carlo (SMC) and diffusion-based samplers into a new method called Sequential Controlled Langevin Diffusion (SCLD), which achieves improved performance on multiple benchmarks, often using only 10% of the training budget of previous diffusion-based samplers.

An effective approach for sampling from unnormalized densities is based on the idea of gradually transporting samples from an easy prior to the complicated target distribution. Two popular methods are (1) Sequential Monte Carlo (SMC), where the transport is performed through successive annealed densities via prescribed Markov chains and resampling steps, and (2) recently developed diffusion-based sampling methods, where a learned dynamical transport is used. Despite the common goal, both approaches have different, often complementary, advantages and drawbacks. The resampling steps in SMC allow focusing on promising regions of the space, often leading to robust performance. While the algorithm enjoys asymptotic guarantees, the lack of flexible, learnable transitions can lead to slow convergence. On the other hand, diffusion-based samplers are learned and can potentially better adapt themselves to the target at hand, yet often suffer from training instabilities. In this work, we present a principled framework for combining SMC with diffusion-based samplers by viewing both methods in continuous time and considering measures on path space. This culminates in the new Sequential Controlled Langevin Diffusion (SCLD) sampling method, which is able to utilize the benefits of both methods and reaches improved performance on multiple benchmark problems, in many cases using only 10% of the training budget of previous diffusion-based samplers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes