CVJul 23, 2020

MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution

arXiv:2007.11803v1181 citations
Originality Incremental advance
AI Analysis

This addresses limitations in existing video super-resolution methods for applications like video enhancement, though it appears incremental by building on prior work with new modules.

The paper tackles video super-resolution by proposing a multi-correspondence aggregation network (MuCAN) that leverages similar patches across frames and self-similarity across scales, achieving state-of-the-art results on multiple benchmark datasets.

Video super-resolution (VSR) aims to utilize multiple low-resolution frames to generate a high-resolution prediction for each frame. In this process, inter- and intra-frames are the key sources for exploiting temporal and spatial information. However, there are a couple of limitations for existing VSR methods. First, optical flow is often used to establish temporal correspondence. But flow estimation itself is error-prone and affects recovery results. Second, similar patterns existing in natural images are rarely exploited for the VSR task. Motivated by these findings, we propose a temporal multi-correspondence aggregation strategy to leverage similar patches across frames, and a cross-scale nonlocal-correspondence aggregation scheme to explore self-similarity of images across scales. Based on these two new modules, we build an effective multi-correspondence aggregation network (MuCAN) for VSR. Our method achieves state-of-the-art results on multiple benchmark datasets. Extensive experiments justify the effectiveness of our method.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes