IVCVSep 24, 2019

Deformable Non-local Network for Video Super-Resolution

arXiv:1909.10692v277 citations
Originality Incremental advance
AI Analysis

This addresses the problem of inaccurate optical flow in video super-resolution for applications like video enhancement, though it is an incremental improvement over existing deep learning methods.

The paper tackles video super-resolution by proposing a deformable non-local network (DNLN) that avoids optical flow alignment, achieving state-of-the-art performance on benchmark datasets.

The video super-resolution (VSR) task aims to restore a high-resolution (HR) video frame by using its corresponding low-resolution (LR) frame and multiple neighboring frames. At present, many deep learning-based VSR methods rely on optical flow to perform frame alignment. The final recovery results will be greatly affected by the accuracy of optical flow. However, optical flow estimation cannot be completely accurate, and there are always some errors. In this paper, we propose a novel deformable non-local network (DNLN) which is a non-optical-flow-based method. Specifically, we apply the deformable convolution and improve its ability of adaptive alignment at the feature level. Furthermore, we utilize a non-local structure to capture the global correlation between the reference frame and the aligned neighboring frames, and simultaneously enhance desired fine details in the aligned frames. To reconstruct the final high-quality HR video frames, we use residual in residual dense blocks to take full advantage of the hierarchical features. Experimental results on benchmark datasets demonstrate that the proposed DNLN can achieve state-of-the-art performance on VSR task.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes