CVApr 9, 2024

Space-Time Video Super-resolution with Neural Operator

arXiv:2404.06036v14 citationsh-index: 16IEEE Transactions on Image Processing
Originality Incremental advance
AI Analysis

This work improves video quality for applications like surveillance or entertainment by enhancing resolution in space and time, though it is incremental as it builds on existing neural network approaches.

The paper tackles the problem of space-time video super-resolution by addressing inaccurate motion estimation and compensation for large motions, achieving state-of-the-art results in both fixed-size and continuous tasks.

This paper addresses the task of space-time video super-resolution (ST-VSR). Existing methods generally suffer from inaccurate motion estimation and motion compensation (MEMC) problems for large motions. Inspired by recent progress in physics-informed neural networks, we model the challenges of MEMC in ST-VSR as a mapping between two continuous function spaces. Specifically, our approach transforms independent low-resolution representations in the coarse-grained continuous function space into refined representations with enriched spatiotemporal details in the fine-grained continuous function space. To achieve efficient and accurate MEMC, we design a Galerkin-type attention function to perform frame alignment and temporal interpolation. Due to the linear complexity of the Galerkin-type attention mechanism, our model avoids patch partitioning and offers global receptive fields, enabling precise estimation of large motions. The experimental results show that the proposed method surpasses state-of-the-art techniques in both fixed-size and continuous space-time video super-resolution tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes