IV CVFeb 3, 2022

Fast Online Video Super-Resolution with Deformable Attention Pyramid

Dario Fuoli, Martin Danelljan, Radu Timofte, Luc Van Gool

arXiv:2202.01731v24.81 citations

Originality Incremental advance

AI Analysis

This work addresses real-time video super-resolution for applications like video streaming and TV, offering an incremental improvement in efficiency.

The paper tackled the problem of video super-resolution under causal, real-time constraints by proposing a recurrent architecture with a deformable attention pyramid, achieving a speed-up of over 3× while surpassing state-of-the-art performance on benchmarks.

Video super-resolution (VSR) has many applications that pose strict causal, real-time, and latency constraints, including video streaming and TV. We address the VSR problem under these settings, which poses additional important challenges since information from future frames is unavailable. Importantly, designing efficient, yet effective frame alignment and fusion modules remain central problems. In this work, we propose a recurrent VSR architecture based on a deformable attention pyramid (DAP). Our DAP aligns and integrates information from the recurrent state into the current frame prediction. To circumvent the computational cost of traditional attention-based methods, we only attend to a limited number of spatial locations, which are dynamically predicted by the DAP. Comprehensive experiments and analysis of the proposed key innovations show the effectiveness of our approach. We significantly reduce processing time and computational complexity in comparison to state-of-the-art methods, while maintaining a high performance. We surpass state-of-the-art method EDVR-M on two standard benchmarks with a speed-up of over $3\times$.

View on arXiv PDF

Similar