LGAIApr 2, 2025

Random Conditioning with Distillation for Data-Efficient Diffusion Model Compression

arXiv:2504.02011v12 citationsh-index: 1CVPR
Originality Highly original
AI Analysis

This work addresses the problem of resource-efficient deployment of generative diffusion models for researchers and real-world applications, representing a novel method for a known bottleneck.

The paper tackles the computational intensity of diffusion models by proposing Random Conditioning, a novel image-free knowledge distillation method that enables student models to generate concepts unseen in training images, resulting in improved generation quality and efficiency.

Diffusion models generate high-quality images through progressive denoising but are computationally intensive due to large model sizes and repeated sampling. Knowledge distillation, which transfers knowledge from a complex teacher to a simpler student model, has been widely studied in recognition tasks, particularly for transferring concepts unseen during student training. However, its application to diffusion models remains underexplored, especially in enabling student models to generate concepts not covered by the training images. In this work, we propose Random Conditioning, a novel approach that pairs noised images with randomly selected text conditions to enable efficient, image-free knowledge distillation. By leveraging this technique, we show that the student can generate concepts unseen in the training images. When applied to conditional diffusion model distillation, our method allows the student to explore the condition space without generating condition-specific images, resulting in notable improvements in both generation quality and efficiency. This promotes resource-efficient deployment of generative diffusion models, broadening their accessibility for both research and real-world applications. Code, models, and datasets are available at https://dohyun-as.github.io/Random-Conditioning .

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes