CVLGFeb 23

Relational Feature Caching for Accelerating Diffusion Transformers

arXiv:2602.19506v11 citationsh-index: 34
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in accelerating diffusion transformers for applications like image generation, representing an incremental advancement over prior caching methods.

The paper tackles the problem of performance degradation in diffusion transformers due to prediction errors in feature caching, and proposes relational feature caching (RFC) which leverages input-output relationships to enhance feature prediction accuracy, achieving significant performance improvements across various DiT models.

Feature caching approaches accelerate diffusion transformers (DiTs) by storing the output features of computationally expensive modules at certain timesteps, and exploiting them for subsequent steps to reduce redundant computations. Recent forecasting-based caching approaches employ temporal extrapolation techniques to approximate the output features with cached ones. Although effective, relying exclusively on temporal extrapolation still suffers from significant prediction errors, leading to performance degradation. Through a detailed analysis, we find that 1) these errors stem from the irregular magnitude of changes in the output features, and 2) an input feature of a module is strongly correlated with the corresponding output. Based on this, we propose relational feature caching (RFC), a novel framework that leverages the input-output relationship to enhance the accuracy of the feature prediction. Specifically, we introduce relational feature estimation (RFE) to estimate the magnitude of changes in the output features from the inputs, enabling more accurate feature predictions. We also present relational cache scheduling (RCS), which estimates the prediction errors using the input features and performs full computations only when the errors are expected to be substantial. Extensive experiments across various DiT models demonstrate that RFC consistently outperforms prior approaches significantly. Project page is available at https://cvlab.yonsei.ac.kr/projects/RFC

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes