CVAINov 7, 2023

Reducing Spatial Fitting Error in Distillation of Denoising Diffusion Models

arXiv:2311.03830v21 citationsh-index: 6Has Code
Originality Incremental advance
AI Analysis

This addresses the computational inefficiency of diffusion models for practitioners by enabling high-quality image generation with fewer steps, though it is incremental as it builds on existing distillation methods.

The paper tackles the problem of generative quality degradation in knowledge distillation for denoising diffusion models, which reduces sampling iterations, and achieves an FID of 5.31 on CIFAR-10 and 9.39 on ImageNet 64x64 with only one step.

Denoising Diffusion models have exhibited remarkable capabilities in image generation. However, generating high-quality samples requires a large number of iterations. Knowledge distillation for diffusion models is an effective method to address this limitation with a shortened sampling process but causes degraded generative quality. Based on our analysis with bias-variance decomposition and experimental observations, we attribute the degradation to the spatial fitting error occurring in the training of both the teacher and student model. Accordingly, we propose $\textbf{S}$patial $\textbf{F}$itting-$\textbf{E}$rror $\textbf{R}$eduction $\textbf{D}$istillation model ($\textbf{SFERD}$). SFERD utilizes attention guidance from the teacher model and a designed semantic gradient predictor to reduce the student's fitting error. Empirically, our proposed model facilitates high-quality sample generation in a few function evaluations. We achieve an FID of 5.31 on CIFAR-10 and 9.39 on ImageNet 64$\times$64 with only one step, outperforming existing diffusion methods. Our study provides a new perspective on diffusion distillation by highlighting the intrinsic denoising ability of models. Project link: \url{https://github.com/Sainzerjj/SFERD}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes