LGAIMLFeb 1, 2022

Progressive Distillation for Fast Sampling of Diffusion Models

arXiv:2202.00512v2349 citations
AI Analysis

This addresses a key bottleneck for practitioners using diffusion models in image generation by enabling faster sampling without significant quality loss, though it is incremental as it builds on existing diffusion frameworks.

The paper tackles the slow sampling time of diffusion models by introducing new parameterizations for stability and a progressive distillation method that reduces sampling steps from thousands to as few as 4, achieving a FID of 3.0 on CIFAR-10 in 4 steps.

Diffusion models have recently shown great promise for generative modeling, outperforming GANs on perceptual quality and autoregressive models at density estimation. A remaining downside is their slow sampling time: generating high quality samples takes many hundreds or thousands of model evaluations. Here we make two contributions to help eliminate this downside: First, we present new parameterizations of diffusion models that provide increased stability when using few sampling steps. Second, we present a method to distill a trained deterministic diffusion sampler, using many steps, into a new diffusion model that takes half as many sampling steps. We then keep progressively applying this distillation procedure to our model, halving the number of required sampling steps each time. On standard image generation benchmarks like CIFAR-10, ImageNet, and LSUN, we start out with state-of-the-art samplers taking as many as 8192 steps, and are able to distill down to models taking as few as 4 steps without losing much perceptual quality; achieving, for example, a FID of 3.0 on CIFAR-10 in 4 steps. Finally, we show that the full progressive distillation procedure does not take more time than it takes to train the original model, thus representing an efficient solution for generative modeling using diffusion at both train and test time.

Code Implementations12 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes