CVApr 5, 2023

BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation

arXiv:2304.02225v138 citationsh-index: 23Has Code
Originality Incremental advance
AI Analysis

This addresses high-resolution video interpolation for applications like video editing or streaming, but appears incremental as it builds on existing motion estimation and refinement techniques.

The paper tackles 4K video frame interpolation by proposing BiFormer, a transformer-based bilateral motion estimator, and achieves excellent performance on 4K datasets.

A novel 4K video frame interpolator based on bilateral transformer (BiFormer) is proposed in this paper, which performs three steps: global motion estimation, local motion refinement, and frame synthesis. First, in global motion estimation, we predict symmetric bilateral motion fields at a coarse scale. To this end, we propose BiFormer, the first transformer-based bilateral motion estimator. Second, we refine the global motion fields efficiently using blockwise bilateral cost volumes (BBCVs). Third, we warp the input frames using the refined motion fields and blend them to synthesize an intermediate frame. Extensive experiments demonstrate that the proposed BiFormer algorithm achieves excellent interpolation performance on 4K datasets. The source codes are available at https://github.com/JunHeum/BiFormer.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes