CVLGIVJul 12, 2022

CANF-VC: Conditional Augmented Normalizing Flows for Video Compression

Peking U
arXiv:2207.05315v392 citationsh-index: 21Has Code
Originality Highly original
AI Analysis

This work addresses video compression for applications requiring efficient storage and transmission, representing a novel approach rather than an incremental improvement.

The paper tackles video compression by proposing CANF-VC, an end-to-end learning-based system using conditional augmented normalizing flows for conditional inter-frame and motion coding, which achieves superiority over state-of-the-art methods on common datasets.

This paper presents an end-to-end learning-based video compression system, termed CANF-VC, based on conditional augmented normalizing flows (CANF). Most learned video compression systems adopt the same hybrid-based coding architecture as the traditional codecs. Recent research on conditional coding has shown the sub-optimality of the hybrid-based coding and opens up opportunities for deep generative models to take a key role in creating new coding frameworks. CANF-VC represents a new attempt that leverages the conditional ANF to learn a video generative model for conditional inter-frame coding. We choose ANF because it is a special type of generative model, which includes variational autoencoder as a special case and is able to achieve better expressiveness. CANF-VC also extends the idea of conditional coding to motion coding, forming a purely conditional coding framework. Extensive experimental results on commonly used datasets confirm the superiority of CANF-VC to the state-of-the-art methods. The source code of CANF-VC is available at https://github.com/NYCU-MAPL/CANF-VC.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes