CVJun 14, 2024

LAVIB: A Large-scale Video Interpolation Benchmark

arXiv:2406.09754v211 citations
AI Analysis

This provides a standardized benchmark for researchers in low-level video processing, though it is incremental as it focuses on dataset creation rather than new methods.

The paper tackles the lack of large-scale datasets for video frame interpolation by introducing LAVIB, a benchmark with 283K clips from 17K ultra-HD videos totaling 77.6 hours, featuring metrics for motion, luminance, sharpness, and contrast, and including out-of-distribution splits.

This paper introduces a LArge-scale Video Interpolation Benchmark (LAVIB) for the low-level video task of Video Frame Interpolation (VFI). LAVIB comprises a large collection of high-resolution videos sourced from the web through an automated pipeline with minimal requirements for human verification. Metrics are computed for each video's motion magnitudes, luminance conditions, frame sharpness, and contrast. The collection of videos and the creation of quantitative challenges based on these metrics are under-explored by current low-level video task datasets. In total, LAVIB includes 283K clips from 17K ultra-HD videos, covering 77.6 hours. Benchmark train, val, and test sets maintain similar video metric distributions. Further splits are also created for out-of-distribution (OOD) challenges, with train and test splits including videos of dissimilar attributes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes