LG CA NAJun 16, 2025

Fast Convergence for High-Order ODE Solvers in Diffusion Probabilistic Models

Daniel Zhengyu Huang, Jiaoyang Huang, Zhengjiang Lin

arXiv:2506.13061v318.89 citationsh-index: 5

Originality Incremental advance

AI Analysis

This provides a rigorous theoretical foundation for deterministic samplers in diffusion models, which is incremental but important for improving sampling efficiency in generative AI.

The paper tackles the problem of analyzing convergence for high-order ODE solvers in diffusion probabilistic models, proving a bound on the total variation distance between generated and target distributions as O(d^{7/4}ε_score^{1/2} + d(dH_max)^p), with numerical experiments confirming bounded score function derivatives.

Diffusion probabilistic models generate samples by learning to reverse a noise-injection process that transforms data into noise. A key development is the reformulation of the reverse sampling process as a deterministic probability flow ordinary differential equation (ODE), which allows for efficient sampling using high-order numerical solvers. Unlike traditional time integrator analysis, the accuracy of this sampling procedure depends not only on numerical integration errors but also on the approximation quality and regularity of the learned score function, as well as their interaction. In this work, we present a rigorous convergence analysis of deterministic samplers derived from probability flow ODEs for general forward processes with arbitrary variance schedules. Specifically, we develop and analyze $p$-th order (exponential) Runge-Kutta schemes, under the practical assumption that the first and second derivatives of the learned score function are bounded. We prove that the total variation distance between the generated and target distributions can be bounded as \begin{align*} O\bigl(d^{\frac{7}{4}}\varepsilon_{\text{score}}^{\frac{1}{2}} +d(dH_{\max})^p\bigr), \end{align*} where $\varepsilon^2_{\text{score}}$ denotes the $L^2$ error in the score function approximation, $d$ is the data dimension, and $H_{\max}$ represents the maximum solver step size. Numerical experiments on benchmark datasets further confirm that the derivatives of the learned score function are bounded in practice.

View on arXiv PDF

Similar