CVNov 28, 2023

Adversarial Diffusion Distillation

Axel Sauer, Dominik Lorenz, Andreas Blattmann, Robin Rombach

arXiv:2311.17042v151.1801 citationsh-index: 20Has Code

Originality Highly original

AI Analysis

This addresses the challenge of real-time image synthesis for applications requiring fast generation, representing a significant advance rather than an incremental improvement.

The paper tackles the problem of slow sampling in large-scale image diffusion models by introducing Adversarial Diffusion Distillation (ADD), which enables efficient sampling in 1-4 steps while maintaining high image quality, outperforming existing few-step methods in a single step and matching state-of-the-art diffusion models in four steps.

We introduce Adversarial Diffusion Distillation (ADD), a novel training approach that efficiently samples large-scale foundational image diffusion models in just 1-4 steps while maintaining high image quality. We use score distillation to leverage large-scale off-the-shelf image diffusion models as a teacher signal in combination with an adversarial loss to ensure high image fidelity even in the low-step regime of one or two sampling steps. Our analyses show that our model clearly outperforms existing few-step methods (GANs, Latent Consistency Models) in a single step and reaches the performance of state-of-the-art diffusion models (SDXL) in only four steps. ADD is the first method to unlock single-step, real-time image synthesis with foundation models. Code and weights available under https://github.com/Stability-AI/generative-models and https://huggingface.co/stabilityai/ .

View on arXiv PDF Code

Similar