The critical slowing down in diffusion models

arXiv:2605.1259721.7
Predicted impact top 37% in DIS-NN · last 90 daysOriginality Highly original
AI Analysis

Provides theoretical insight into the limitations of diffusion models near criticality and demonstrates how architectural design can overcome these bottlenecks, relevant for statistical physics and generative modeling.

The paper analyzes diffusion models in the Gaussian limit of the O(n) model, showing that training a one-layer score model exhibits critical slowing down, which is overcome by a two-layer architecture that reduces training time scaling from quadratic to logarithmic in system size.

Computational sampling has been central to the sciences since the mid-20th century. While machine-learning-based approaches have recently enabled major advances, their behavior remains poorly understood, with limited theoretical control over when and why they succeed. Here we provide such insight for diffusion models-a class of generative schemes highly effective in practice-by analyzing their application to the $O(n)$ model of statistical field theory in the Gaussian limit $n \to \infty$. In this analytically tractable setting, we show that training a score model with a one-layer network architecture matching the exact solution exhibits a form of critical slowing down in parameter learning. This slowing down also impacts the generation process, indicating that the well-known difficulties of sampling near criticality persist even for learned generative models. To overcome this bottleneck, we demonstrate the power of combining architectural depth with physical locality. We find that using a two-layer architecture drastically reduces the critical slowing down, with the training time scaling logarithmically rather than quadratically with system size. By introducing a local score approximation we show that this acceleration in training time can be achieved without increasing the number of neural network parameters. Taken together, these results demonstrate that diffusion models can overcome the critical slowing down through appropriate architectural design, and establish a controlled framework for understanding and improving learned sampling methods in statistical physics and beyond.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes