MMSep 24, 2021

Spatial Information Refinement for Chroma Intra Prediction in Video Coding

arXiv:2109.11913v16 citations
Originality Incremental advance
AI Analysis

This work addresses video compression efficiency for codec developers, but it is incremental as it builds on existing neural network-based methods.

The paper tackled improving chroma intra prediction in video coding by proposing spatial information refinement methods, such as refined down-sampling and incorporating location information, which resulted in BD-rate reductions of up to 3.00% on chroma components under the All-Intra configuration in VVC.

Video compression benefits from advanced chroma intra prediction methods, such as the Cross-Component Linear Model (CCLM) which uses linear models to approximate the relationship between the luma and chroma components. Recently it has been proven that advanced cross-component prediction methods based on Neural Networks (NN) can bring additional coding gains. In this paper, spatial information refinement is proposed for improving NN-based chroma intra prediction. Specifically, the performance of chroma intra prediction can be improved by refined down-sampling or by incorporating location information. Experimental results show that the two proposed methods obtain 0.31%, 2.64%, 2.02% and 0.33%, 3.00%, 2.12% BD-rate reduction on Y, Cb and Cr components, respectively, under All-Intra configuration, when implemented in Versatile Video Coding (H.266/VVC) test model. Index Terms-Chroma intra prediction, convolutional neural networks, spatial information refinement.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes