CVFeb 1, 2025

CAT Pruning: Cluster-Aware Token Pruning For Text-to-Image Diffusion Models

arXiv:2502.00433v116 citationsh-index: 9Has Code
Originality Incremental advance
AI Analysis

This addresses efficiency for users of diffusion models, but it is incremental as it builds on existing pruning and caching techniques.

The paper tackles the high computational cost of text-to-image diffusion models by proposing a token pruning method with caching, achieving a 50%-60% reduction in computational costs while maintaining model performance.

Diffusion models have revolutionized generative tasks, especially in the domain of text-to-image synthesis; however, their iterative denoising process demands substantial computational resources. In this paper, we present a novel acceleration strategy that integrates token-level pruning with caching techniques to tackle this computational challenge. By employing noise relative magnitude, we identify significant token changes across denoising iterations. Additionally, we enhance token selection by incorporating spatial clustering and ensuring distributional balance. Our experiments demonstrate reveal a 50%-60% reduction in computational costs while preserving the performance of the model, thereby markedly increasing the efficiency of diffusion models. The code is available at https://github.com/ada-cheng/CAT-Pruning

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes