IVCVJun 27, 2022

Flexible-Rate Learned Hierarchical Bi-Directional Video Compression With Motion Refinement and Frame-Level Bit Allocation

arXiv:2206.13613v118 citationsh-index: 62
Originality Incremental advance
AI Analysis

This work advances learned video compression, which is important for efficient video storage and streaming, though it builds incrementally on existing hierarchical bi-directional approaches.

The paper tackles video compression by improving motion refinement and enabling flexible bit allocation across frames, achieving state-of-the-art rate-distortion performance that exceeds all prior learned video coding methods.

This paper presents improvements and novel additions to our recent work on end-to-end optimized hierarchical bi-directional video compression to further advance the state-of-the-art in learned video compression. As an improvement, we combine motion estimation and prediction modules and compress refined residual motion vectors for improved rate-distortion performance. As novel addition, we adapted the gain unit proposed for image compression to flexible-rate video compression in two ways: first, the gain unit enables a single encoder model to operate at multiple rate-distortion operating points; second, we exploit the gain unit to control bit allocation among intra-coded vs. bi-directionally coded frames by fine tuning corresponding models for truly flexible-rate learned video coding. Experimental results demonstrate that we obtain state-of-the-art rate-distortion performance exceeding those of all prior art in learned video coding.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes