CVAIMMJan 15

Towards Efficient Low-rate Image Compression with Frequency-aware Diffusion Prior Refinement

arXiv:2601.10373v11 citationsh-index: 11
Originality Incremental advance
AI Analysis

This work addresses efficiency and fidelity issues in low-rate image compression for applications requiring fast and high-quality reconstruction, representing a strong incremental improvement over existing methods.

The paper tackles the problem of slow sampling and suboptimal bit allocation in diffusion-based image compression at low bit rates, achieving a 27.2% BD-rate saving in LPIPS, 65.1% in PNR, and over 10x speed-up compared to state-of-the-art baselines.

Recent advancements in diffusion-based generative priors have enabled visually plausible image compression at extremely low bit rates. However, existing approaches suffer from slow sampling processes and suboptimal bit allocation due to fragmented training paradigms. In this work, we propose Accelerate \textbf{Diff}usion-based Image Compression via \textbf{C}onsistency Prior \textbf{R}efinement (DiffCR), a novel compression framework for efficient and high-fidelity image reconstruction. At the heart of DiffCR is a Frequency-aware Skip Estimation (FaSE) module that refines the $ε$-prediction prior from a pre-trained latent diffusion model and aligns it with compressed latents at different timesteps via Frequency Decoupling Attention (FDA). Furthermore, a lightweight consistency estimator enables fast \textbf{two-step decoding} by preserving the semantic trajectory of diffusion sampling. Without updating the backbone diffusion model, DiffCR achieves substantial bitrate savings (27.2\% BD-rate (LPIPS) and 65.1\% BD-rate (PSNR)) and over $10\times$ speed-up compared to SOTA diffusion-based compression baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes