LG STAT-MECH AI COMP-PH MLFeb 12, 2025

Scalable Discrete Diffusion Samplers: Combinatorial Optimization and Statistical Physics

Sebastian Sanokowski, Wilhelm Berghammer, Martin Ennemoser, Haoyu Peter Wang, Sepp Hochreiter, Sebastian Lehner

arXiv:2502.08696v326.924 citationsh-index: 58ICLR

Originality Incremental advance

AI Analysis

This work addresses memory efficiency and unbiased sampling for discrete diffusion models, enabling broader scientific applications in statistical physics and combinatorial optimization, though it is incremental in improving existing methods.

The authors tackled the memory scaling limitations of discrete diffusion models by introducing two novel training methods, achieving state-of-the-art results in unsupervised combinatorial optimization and outperforming autoregressive approaches on Ising model benchmarks.

Learning to sample from complex unnormalized distributions over discrete domains emerged as a promising research direction with applications in statistical physics, variational inference, and combinatorial optimization. Recent work has demonstrated the potential of diffusion models in this domain. However, existing methods face limitations in memory scaling and thus the number of attainable diffusion steps since they require backpropagation through the entire generative process. To overcome these limitations we introduce two novel training methods for discrete diffusion samplers, one grounded in the policy gradient theorem and the other one leveraging Self-Normalized Neural Importance Sampling (SN-NIS). These methods yield memory-efficient training and achieve state-of-the-art results in unsupervised combinatorial optimization. Numerous scientific applications additionally require the ability of unbiased sampling. We introduce adaptations of SN-NIS and Neural Markov Chain Monte Carlo that enable for the first time the application of discrete diffusion models to this problem. We validate our methods on Ising model benchmarks and find that they outperform popular autoregressive approaches. Our work opens new avenues for applying diffusion models to a wide range of scientific applications in discrete domains that were hitherto restricted to exact likelihood models.

View on arXiv PDF

Similar