CVAINov 18, 2024

Unveiling Redundancy in Diffusion Transformers (DiTs): A Systematic Study

arXiv:2411.13588v19 citationsh-index: 4Has Code
Originality Synthesis-oriented
AI Analysis

This work tackles the problem of slow inference in DiTs for image and video generation, but it is incremental as it primarily provides analysis tools rather than a new solution.

The study investigated redundancy in Diffusion Transformers (DiTs) to address high inference latency, finding that redundancy distribution varies significantly across models but is stable within each model regardless of inputs or settings.

The increased model capacity of Diffusion Transformers (DiTs) and the demand for generating higher resolutions of images and videos have led to a significant rise in inference latency, impacting real-time performance adversely. While prior research has highlighted the presence of high similarity in activation values between adjacent diffusion steps (referred to as redundancy) and proposed various caching mechanisms to mitigate computational overhead, the exploration of redundancy in existing literature remains limited, with findings often not generalizable across different DiT models. This study aims to address this gap by conducting a comprehensive investigation into redundancy across a broad spectrum of mainstream DiT models. Our experimental analysis reveals substantial variations in the distribution of redundancy across diffusion steps among different DiT models. Interestingly, within a single model, the redundancy distribution remains stable regardless of variations in input prompts, step counts, or scheduling strategies. Given the lack of a consistent pattern across diverse models, caching strategies designed for a specific group of models may not easily transfer to others. To overcome this challenge, we introduce a tool for analyzing the redundancy of individual models, enabling subsequent research to develop tailored caching strategies for specific model architectures. The project is publicly available at https://github.com/xdit-project/DiTCacheAnalysis.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes