CVMay 28, 2025

Re-ttention: Ultra Sparse Visual Generation via Attention Statistical Reshape

arXiv:2505.22918v46 citationsh-index: 8
Originality Incremental advance
AI Analysis

This addresses efficiency issues for researchers and practitioners using high-resolution video and image generation models, though it is incremental as it builds on existing sparse attention methods.

The paper tackles the computational bottleneck of quadratic attention in Diffusion Transformers for visual generation by proposing Re-ttention, which uses temporal redundancy to enable ultra-sparse attention with as few as 3.1% of tokens while preserving visual quality.

Diffusion Transformers (DiT) have become the de-facto model for generating high-quality visual content like videos and images. A huge bottleneck is the attention mechanism where complexity scales quadratically with resolution and video length. One logical way to lessen this burden is sparse attention, where only a subset of tokens or patches are included in the calculation. However, existing techniques fail to preserve visual quality at extremely high sparsity levels and might even incur non-negligible compute overheads. To address this concern, we propose Re-ttention, which implements very high sparse attention for visual generation models by leveraging the temporal redundancy of Diffusion Models to overcome the probabilistic normalization shift within the attention mechanism. Specifically, Re-ttention reshapes attention scores based on the prior softmax distribution history in order to preserve the visual quality of the full quadratic attention at very high sparsity levels. Experimental results on T2V/T2I models such as CogVideoX and the PixArt DiTs demonstrate that Re-ttention requires as few as 3.1% of the tokens during inference, outperforming contemporary methods like FastDiTAttn, Sparse VideoGen and MInference.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes