IVCVFeb 14, 2024

Extreme Video Compression with Pre-trained Diffusion Models

arXiv:2402.08934v110 citationsh-index: 9Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of efficient video compression for applications requiring low bandwidth, though it is incremental as it builds on existing diffusion model techniques.

The paper tackles extreme video compression by using a pre-trained diffusion model at the decoder to predict frames from neural compressed inputs, achieving visually pleasing reconstruction at bit rates as low as 0.02 bits per pixel, outperforming standard codecs like H.264 and H.265 in low bit-rate regimes.

Diffusion models have achieved remarkable success in generating high quality image and video data. More recently, they have also been used for image compression with high perceptual quality. In this paper, we present a novel approach to extreme video compression leveraging the predictive power of diffusion-based generative models at the decoder. The conditional diffusion model takes several neural compressed frames and generates subsequent frames. When the reconstruction quality drops below the desired level, new frames are encoded to restart prediction. The entire video is sequentially encoded to achieve a visually pleasing reconstruction, considering perceptual quality metrics such as the learned perceptual image patch similarity (LPIPS) and the Frechet video distance (FVD), at bit rates as low as 0.02 bits per pixel (bpp). Experimental results demonstrate the effectiveness of the proposed scheme compared to standard codecs such as H.264 and H.265 in the low bpp regime. The results showcase the potential of exploiting the temporal relations in video data using generative models. Code is available at: https://github.com/ElesionKyrie/Extreme-Video-Compression-With-Prediction-Using-Pre-trainded-Diffusion-Models-

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes