IVLGMMNov 22, 2019

Dual Learning-based Video Coding with Inception Dense Blocks

arXiv:1911.09857v11 citations
Originality Incremental advance
AI Analysis

This addresses video coding efficiency for compression applications, representing an incremental improvement by combining existing network types in a novel way.

The paper tackles video compression by introducing a dual learning-based method combining intra prediction and reconstruction filtering networks, achieving state-of-the-art performance with average bitrate reductions of 10.24% for all-intra and 3.57% for random-access coding compared to a baseline.

In this paper, a dual learning-based method in intra coding is introduced for PCS Grand Challenge. This method is mainly composed of two parts: intra prediction and reconstruction filtering. They use different network structures, the neural network-based intra prediction uses the full-connected network to predict the block while the neural network-based reconstruction filtering utilizes the convolutional networks. Different with the previous filtering works, we use a network with more powerful feature extraction capabilities in our reconstruction filtering network. And the filtering unit is the block-level so as to achieve a more accurate filtering compensation. To our best knowledge, among all the learning-based methods, this is the first attempt to combine two different networks in one application, and we achieve the state-of-the-art performance for AI configuration on the HEVC Test sequences. The experimental result shows that our method leads to significant BD-rate saving for provided 8 sequences compared to HM-16.20 baseline (average 10.24% and 3.57% bitrate reductions for all-intra and random-access coding, respectively). For HEVC test sequences, our model also achieved a 9.70% BD-rate saving compared to HM-16.20 baseline for all-intra configuration.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes