IVAICVAug 4, 2021

Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction

arXiv:2108.02110v259 citationsHas Code
AI Analysis

This work addresses video quality enhancement for compressed videos, offering incremental improvements over existing deep learning methods.

The paper tackles video compression artifact reduction by proposing a Recursive Fusion module for long-range temporal dependency and a Deformable Spatiotemporal Attention module to focus on artifact-rich areas, achieving state-of-the-art performance on the MFQE 2.0 dataset in fidelity and perceptual metrics.

A number of deep learning based algorithms have been proposed to recover high-quality videos from low-quality compressed ones. Among them, some restore the missing details of each frame via exploring the spatiotemporal information of neighboring frames. However, these methods usually suffer from a narrow temporal scope, thus may miss some useful details from some frames outside the neighboring ones. In this paper, to boost artifact removal, on the one hand, we propose a Recursive Fusion (RF) module to model the temporal dependency within a long temporal range. Specifically, RF utilizes both the current reference frames and the preceding hidden state to conduct better spatiotemporal compensation. On the other hand, we design an efficient and effective Deformable Spatiotemporal Attention (DSTA) module such that the model can pay more effort on restoring the artifact-rich areas like the boundary area of a moving object. Extensive experiments show that our method outperforms the existing ones on the MFQE 2.0 dataset in terms of both fidelity and perceptual effect. Code is available at https://github.com/zhaominyiz/RFDA-PyTorch.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes