CVLGJun 18, 2025

Modulated Diffusion: Accelerating Generative Modeling with Modulated Quantization

arXiv:2506.22463v13 citationsh-index: 5Has CodeICML
Originality Incremental advance
AI Analysis

This work addresses the computational bottleneck in diffusion models for generative modeling, offering an incremental improvement over existing acceleration techniques.

The paper tackles the high computation cost in diffusion models by introducing Modulated Diffusion (MoDiff), a framework using modulated quantization and error compensation, which reduces activation quantization from 8 bits to 3 bits without performance degradation on datasets like CIFAR-10 and LSUN.

Diffusion models have emerged as powerful generative models, but their high computation cost in iterative sampling remains a significant bottleneck. In this work, we present an in-depth and insightful study of state-of-the-art acceleration techniques for diffusion models, including caching and quantization, revealing their limitations in computation error and generation quality. To break these limits, this work introduces Modulated Diffusion (MoDiff), an innovative, rigorous, and principled framework that accelerates generative modeling through modulated quantization and error compensation. MoDiff not only inherents the advantages of existing caching and quantization methods but also serves as a general framework to accelerate all diffusion models. The advantages of MoDiff are supported by solid theoretical insight and analysis. In addition, extensive experiments on CIFAR-10 and LSUN demonstrate that MoDiff significant reduces activation quantization from 8 bits to 3 bits without performance degradation in post-training quantization (PTQ). Our code implementation is available at https://github.com/WeizhiGao/MoDiff.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes