IVCVNov 26, 2024

Motion Free B-frame Coding for Neural Video Compression

arXiv:2411.17160v11 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of computational overhead and blur artifacts in neural video compression for applications requiring efficient video encoding, representing an incremental improvement over existing hybrid methods.

The paper tackles the inefficiency and high computational cost of motion-based modules in neural video compression by proposing a kernel-based motion-free approach, which improves coding efficiency, reduces complexity, and achieves competitive performance on datasets like HEVC-class B, UVG, and MCL-JCV.

Typical deep neural video compression networks usually follow the hybrid approach of classical video coding that contains two separate modules: motion coding and residual coding. In addition, a symmetric auto-encoder is often used as a normal architecture for both motion and residual coding. In this paper, we propose a novel approach that handles the drawbacks of the two typical above-mentioned architectures, we call it kernel-based motion-free video coding. The advantages of the motion-free approach are twofold: it improves the coding efficiency of the network and significantly reduces computational complexity thanks to eliminating motion estimation, motion compensation, and motion coding which are the most time-consuming engines. In addition, the kernel-based auto-encoder alleviates blur artifacts that usually occur with the conventional symmetric autoencoder. Consequently, it improves the visual quality of the reconstructed frames. Experimental results show the proposed framework outperforms the SOTA deep neural video compression networks on the HEVC-class B dataset and is competitive on the UVG and MCL-JCV datasets. In addition, it generates high-quality reconstructed frames in comparison with conventional motion coding-based symmetric auto-encoder meanwhile its model size is much smaller than that of the motion-based networks around three to four times.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes