IVAILGMMDec 17, 2023

Light-weight CNN-based VVC Inter Partitioning Acceleration

arXiv:2312.10567v116 citationsh-index: 41IVMSP
Originality Incremental advance
AI Analysis

This work addresses the computational bottleneck in VVC encoding for video compression applications, representing an incremental improvement.

The paper tackles the high encoder complexity of the VVC video coding standard by proposing a lightweight CNN-based method to accelerate inter partitioning, achieving 17-30% speedup with a 0.37-1.18% BD-rate increase.

The Versatile Video Coding (VVC) standard has been finalized by Joint Video Exploration Team (JVET) in 2020. Compared to the High Efficiency Video Coding (HEVC) standard, VVC offers about 50% compression efficiency gain, in terms of Bjontegaard Delta-Rate (BD-rate), at the cost of about 10x more encoder complexity. In this paper, we propose a Convolutional Neural Network (CNN)-based method to speed up inter partitioning in VVC. Our method operates at the Coding Tree Unit (CTU) level, by splitting each CTU into a fixed grid of 8x8 blocks. Then each cell in this grid is associated with information about the partitioning depth within that area. A lightweight network for predicting this grid is employed during the rate-distortion optimization to limit the Quaternary Tree (QT)-split search and avoid partitions that are unlikely to be selected. Experiments show that the proposed method can achieve acceleration ranging from 17% to 30% in the RandomAccess Group Of Picture 32 (RAGOP32) mode of VVC Test Model (VTM)10 with a reasonable efficiency drop ranging from 0.37% to 1.18% in terms of BD-rate increase.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes