IVCVLGOct 2, 2023

MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device

arXiv:2310.01258v320 citationsh-index: 19
Originality Incremental advance
AI Analysis

This work addresses the problem of efficient neural video compression for mobile applications, representing an incremental improvement over prior mobile-focused neural codecs.

The paper tackles the challenge of deploying neural video codecs on mobile devices by presenting MobileNVC, the first codec that decodes 1080p YUV420 video in real time on a mobile device, achieving up to 48% BD-rate savings and reducing MAC count by 10x compared to previous on-device codecs.

Neural video codecs have recently become competitive with standard codecs such as HEVC in the low-delay setting. However, most neural codecs are large floating-point networks that use pixel-dense warping operations for temporal modeling, making them too computationally expensive for deployment on mobile devices. Recent work has demonstrated that running a neural decoder in real time on mobile is feasible, but shows this only for 720p RGB video. This work presents the first neural video codec that decodes 1080p YUV420 video in real time on a mobile device. Our codec relies on two major contributions. First, we design an efficient codec that uses a block-based motion compensation algorithm available on the warping core of the mobile accelerator, and we show how to quantize this model to integer precision. Second, we implement a fast decoder pipeline that concurrently runs neural network components on the neural signal processor, parallel entropy coding on the mobile GPU, and warping on the warping core. Our codec outperforms the previous on-device codec by a large margin with up to 48% BD-rate savings, while reducing the MAC count on the receiver side by $10 \times$. We perform a careful ablation to demonstrate the effect of the introduced motion compensation scheme, and ablate the effect of model quantization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes