IVCVJul 17, 2024

High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion

arXiv:2407.12538v36 citationsh-index: 20Has Code
Originality Incremental advance
AI Analysis

This work addresses image compression for applications requiring high detail fidelity, representing an incremental improvement by combining wavelet transforms with diffusion models.

The paper tackles the challenge of balancing high perceptual quality and low distortion in image compression using diffusion models by proposing UGDiff, which focuses on high-frequency compression via wavelet transform and an uncertainty-guided approach, achieving state-of-the-art performance in rate-distortion, perceptual quality, and inference time on benchmark datasets.

Diffusion probabilistic models have recently achieved remarkable success in generating high-quality images. However, balancing high perceptual quality and low distortion remains challenging in application of diffusion models in image compression. To address this issue, we propose a novel Uncertainty-Guided image compression approach with wavelet Diffusion (UGDiff). Our approach focuses on high frequency compression via the wavelet transform, since high frequency components are crucial for reconstructing image details. We introduce a wavelet conditional diffusion model for high frequency prediction, followed by a residual codec that compresses and transmits prediction residuals to the decoder. This diffusion prediction-then-residual compression paradigm effectively addresses the low fidelity issue common in direct reconstructions by existing diffusion models. Considering the uncertainty from the random sampling of the diffusion model, we further design an uncertainty-weighted rate-distortion (R-D) loss tailored for residual compression, providing a more rational trade-off between rate and distortion. Comprehensive experiments on two benchmark datasets validate the effectiveness of UGDiff, surpassing state-of-the-art image compression methods in R-D performance, perceptual quality, subjective quality, and inference time. Our code is available at: https://github.com/hejiaxiang1/Wavelet-Diffusion/tree/main.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes