CVAug 5, 2017

Video Frame Interpolation via Adaptive Separable Convolution

arXiv:1708.01692v1771 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of efficient and high-quality video frame interpolation for video processing applications, offering a practical solution with reduced computational overhead.

The paper tackles video frame interpolation by proposing a method that uses adaptive separable convolution with 1D kernels to reduce memory demands and enable end-to-end training with perceptual loss, achieving high-quality results as shown in qualitative and quantitative experiments.

Standard video frame interpolation methods first estimate optical flow between input frames and then synthesize an intermediate frame guided by motion. Recent approaches merge these two steps into a single convolution process by convolving input frames with spatially adaptive kernels that account for motion and re-sampling simultaneously. These methods require large kernels to handle large motion, which limits the number of pixels whose kernels can be estimated at once due to the large memory demand. To address this problem, this paper formulates frame interpolation as local separable convolution over input frames using pairs of 1D kernels. Compared to regular 2D kernels, the 1D kernels require significantly fewer parameters to be estimated. Our method develops a deep fully convolutional neural network that takes two input frames and estimates pairs of 1D kernels for all pixels simultaneously. Since our method is able to estimate kernels and synthesizes the whole video frame at once, it allows for the incorporation of perceptual loss to train the neural network to produce visually pleasing frames. This deep neural network is trained end-to-end using widely available video data without any human annotation. Both qualitative and quantitative experiments show that our method provides a practical solution to high-quality video frame interpolation.

Code Implementations6 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes