LG ST MLJul 11, 2025

Beyond Scores: Proximal Diffusion Models

Zhenghan Fang, Mateo Díaz, Sam Buchanan, Jeremias Sulam

arXiv:2507.08956v113.05 citationsh-index: 3

Originality Highly original

AI Analysis

This work addresses the sampling efficiency bottleneck in generative modeling for high-dimensional data, offering a novel method with incremental improvements over existing score-based approaches.

The paper tackles the problem of slow sampling in diffusion models by proposing Proximal Diffusion Models (ProxDM), which use proximal maps instead of scores, resulting in theoretical guarantees of O(d/√ε) steps for ε-accuracy and empirical demonstrations of significantly faster convergence in few sampling steps.

Diffusion models have quickly become some of the most popular and powerful generative models for high-dimensional data. The key insight that enabled their development was the realization that access to the score -- the gradient of the log-density at different noise levels -- allows for sampling from data distributions by solving a reverse-time stochastic differential equation (SDE) via forward discretization, and that popular denoisers allow for unbiased estimators of this score. In this paper, we demonstrate that an alternative, backward discretization of these SDEs, using proximal maps in place of the score, leads to theoretical and practical benefits. We leverage recent results in proximal matching to learn proximal operators of the log-density and, with them, develop Proximal Diffusion Models (ProxDM). Theoretically, we prove that $\widetilde{O}(d/\sqrt{\varepsilon})$ steps suffice for the resulting discretization to generate an $\varepsilon$-accurate distribution w.r.t. the KL divergence. Empirically, we show that two variants of ProxDM achieve significantly faster convergence within just a few sampling steps compared to conventional score-matching methods.

View on arXiv PDF

Similar