LGMLFeb 5

Discrete diffusion samplers and bridges: Off-policy algorithms and applications in latent spaces

arXiv:2602.05961v12 citationsh-index: 29
Originality Incremental advance
AI Analysis

This work addresses a gap in discrete-space sampling for researchers in statistics and machine learning, offering incremental improvements and new applications.

The paper tackles the challenge of sampling from unnormalized discrete distributions by introducing off-policy training techniques for discrete diffusion samplers, showing improved performance on benchmarks, and extends them to bridging tasks and applications in latent spaces of image models.

Sampling from a distribution $p(x) \propto e^{-\mathcal{E}(x)}$ known up to a normalising constant is an important and challenging problem in statistics. Recent years have seen the rise of a new family of amortised sampling algorithms, commonly referred to as diffusion samplers, that enable fast and efficient sampling from an unnormalised density. Such algorithms have been widely studied for continuous-space sampling tasks; however, their application to problems in discrete space remains largely unexplored. Although some progress has been made in this area, discrete diffusion samplers do not take full advantage of ideas commonly used for continuous-space sampling. In this paper, we propose to bridge this gap by introducing off-policy training techniques for discrete diffusion samplers. We show that these techniques improve the performance of discrete samplers on both established and new synthetic benchmarks. Next, we generalise discrete diffusion samplers to the task of bridging between two arbitrary distributions, introducing data-to-energy Schrödinger bridge training for the discrete domain for the first time. Lastly, we showcase the application of the proposed diffusion samplers to data-free posterior sampling in the discrete latent spaces of image generative models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes