CVSep 4, 2025

LMVC: An End-to-End Learned Multiview Video Coding Framework

arXiv:2509.03922v1h-index: 1
Originality Incremental advance
AI Analysis

This addresses the problem of efficient compression for multiview video, which is crucial for applications like volumetric video, though it is incremental as it builds on existing deep learning-based video coding methods.

The paper tackles the challenge of compressing multiview video for storage and transmission by proposing an end-to-end learned framework that leverages inter-view motion and content correlations, achieving a large margin of improvement over the traditional MV-HEVC standard.

Multiview video is a key data source for volumetric video, enabling immersive 3D scene reconstruction but posing significant challenges in storage and transmission due to its massive data volume. Recently, deep learning-based end-to-end video coding has achieved great success, yet most focus on single-view or stereo videos, leaving general multiview scenarios underexplored. This paper proposes an end-to-end learned multiview video coding (LMVC) framework that ensures random access and backward compatibility while enhancing compression efficiency. Our key innovation lies in effectively leveraging independent-view motion and content information to enhance dependent-view compression. Specifically, to exploit the inter-view motion correlation, we propose a feature-based inter-view motion vector prediction method that conditions dependent-view motion encoding on decoded independent-view motion features, along with an inter-view motion entropy model that learns inter-view motion priors. To exploit the inter-view content correlation, we propose a disparity-free inter-view context prediction module that predicts inter-view contexts from decoded independent-view content features, combined with an inter-view contextual entropy model that captures inter-view context priors. Experimental results show that our proposed LMVC framework outperforms the reference software of the traditional MV-HEVC standard by a large margin, establishing a strong baseline for future research in this field.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes