LGMay 24, 2024

Learning to Discretize Denoising Diffusion ODEs

arXiv:2405.15506v428 citationsh-index: 41Has CodeICLR
Originality Incremental advance
AI Analysis

This work addresses the sampling efficiency problem for users of diffusion models, offering an incremental improvement by optimizing discretization without retraining.

The paper tackles the high computational cost of sampling from pre-trained diffusion models by proposing LD3, a lightweight framework that learns optimal time discretization to reduce neural function evaluations while preserving quality, achieving FIDs of 2.38 and 2.27 on CIFAR10 and AFHQv2 with 10 NFEs.

Diffusion Probabilistic Models (DPMs) are generative models showing competitive performance in various domains, including image synthesis and 3D point cloud generation. Sampling from pre-trained DPMs involves multiple neural function evaluations (NFEs) to transform Gaussian noise samples into images, resulting in higher computational costs compared to single-step generative models such as GANs or VAEs. Therefore, reducing the number of NFEs while preserving generation quality is crucial. To address this, we propose LD3, a lightweight framework designed to learn the optimal time discretization for sampling. LD3 can be combined with various samplers and consistently improves generation quality without having to retrain resource-intensive neural networks. We demonstrate analytically and empirically that LD3 improves sampling efficiency with much less computational overhead. We evaluate our method with extensive experiments on 7 pre-trained models, covering unconditional and conditional sampling in both pixel-space and latent-space DPMs. We achieve FIDs of 2.38 (10 NFE), and 2.27 (10 NFE) on unconditional CIFAR10 and AFHQv2 in 5-10 minutes of training. LD3 offers an efficient approach to sampling from pre-trained diffusion models. Code is available at https://github.com/vinhsuhi/LD3.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes