CVJul 3, 2025

Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching

arXiv:2507.02860v126 citationsh-index: 21Has Code
Originality Incremental advance
AI Analysis

This work addresses a bottleneck in video generation for researchers and practitioners by providing an efficient, accessible solution that accelerates inference without retraining.

The paper tackles the slow inference speeds and high computational costs of video diffusion models by proposing EasyCache, a training-free acceleration framework that uses runtime-adaptive caching to reuse computed transformation vectors, achieving up to 2.1-3.3x faster inference and up to 36% PSNR improvement over previous methods.

Video generation models have demonstrated remarkable performance, yet their broader adoption remains constrained by slow inference speeds and substantial computational costs, primarily due to the iterative nature of the denoising process. Addressing this bottleneck is essential for democratizing advanced video synthesis technologies and enabling their integration into real-world applications. This work proposes EasyCache, a training-free acceleration framework for video diffusion models. EasyCache introduces a lightweight, runtime-adaptive caching mechanism that dynamically reuses previously computed transformation vectors, avoiding redundant computations during inference. Unlike prior approaches, EasyCache requires no offline profiling, pre-computation, or extensive parameter tuning. We conduct comprehensive studies on various large-scale video generation models, including OpenSora, Wan2.1, and HunyuanVideo. Our method achieves leading acceleration performance, reducing inference time by up to 2.1-3.3$\times$ compared to the original baselines while maintaining high visual fidelity with a significant up to 36% PSNR improvement compared to the previous SOTA method. This improvement makes our EasyCache a efficient and highly accessible solution for high-quality video generation in both research and practical applications. The code is available at https://github.com/H-EmbodVis/EasyCache.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes