IVCVMMNov 24, 2025

Neural B-Frame Coding: Tackling Domain Shift Issues with Lightweight Online Motion Resolution Adaptation

arXiv:2511.18724v1Has Code
Originality Incremental advance
AI Analysis

This work addresses a specific computational bottleneck in video compression for applications requiring efficient encoding, though it appears incremental as it builds on existing B-frame codec frameworks.

This paper tackles the domain-shift issue in learned B-frame video codecs caused by mismatched Group-of-Pictures sizes, which leads to inaccurate motion estimates, by introducing lightweight classifiers to predict optimal downsampling factors for motion resolution adaptation. The proposed methods achieve coding performance comparable to exhaustive search approaches while significantly reducing computational complexity.

Learned B-frame codecs with hierarchical temporal prediction often encounter the domain-shift issue due to mismatches between the Group-of-Pictures (GOP) sizes for training and testing, leading to inaccurate motion estimates, particularly for large motion. A common solution is to turn large motion into small motion by downsampling video frames during motion estimation. However, determining the optimal downsampling factor typically requires costly rate-distortion optimization. This work introduces lightweight classifiers to predict downsampling factors. These classifiers leverage simple state signals from current and reference frames to balance rate-distortion performance with computational cost. Three variants are proposed: (1) a binary classifier (Bi-Class) trained with Focal Loss to choose between high and low resolutions, (2) a multi-class classifier (Mu-Class) trained with novel soft labels based on rate-distortion costs, and (3) a co-class approach (Co-Class) that combines the predictive capability of the multi-class classifier with the selective search of the binary classifier. All classifier methods can work seamlessly with existing B-frame codecs without requiring codec retraining. Experimental results show that they achieve coding performance comparable to exhaustive search methods while significantly reducing computational complexity. The code is available at: https://github.com/NYCU-MAPL/Fast-OMRA.git.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes