CVNov 7, 2022

A Unified Pyramid Recurrent Network for Video Frame Interpolation

arXiv:2211.03456v261 citationsh-index: 9Has Code
Originality Incremental advance
AI Analysis

This work addresses frame interpolation for video processing applications, presenting an incremental improvement in efficiency and robustness.

The paper tackles video frame interpolation by proposing UPR-Net, a lightweight unified pyramid recurrent network that improves robustness for large motions, achieving excellent performance with only 1.7M parameters across multiple benchmarks.

Flow-guided synthesis provides a common framework for frame interpolation, where optical flow is estimated to guide the synthesis of intermediate frames between consecutive inputs. In this paper, we present UPR-Net, a novel Unified Pyramid Recurrent Network for frame interpolation. Cast in a flexible pyramid framework, UPR-Net exploits lightweight recurrent modules for both bi-directional flow estimation and intermediate frame synthesis. At each pyramid level, it leverages estimated bi-directional flow to generate forward-warped representations for frame synthesis; across pyramid levels, it enables iterative refinement for both optical flow and intermediate frame. In particular, we show that our iterative synthesis strategy can significantly improve the robustness of frame interpolation on large motion cases. Despite being extremely lightweight (1.7M parameters), our base version of UPR-Net achieves excellent performance on a large range of benchmarks. Code and trained models of our UPR-Net series are available at: https://github.com/srcn-ivl/UPR-Net.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes