CVGRMar 30, 2023

Implicit View-Time Interpolation of Stereo Videos using Multi-Plane Disparities and Non-Uniform Coordinates

arXiv:2303.17181v12 citationsh-index: 27
Originality Incremental advance
AI Analysis

This work improves stereo video interpolation for applications like VR and film, but it is incremental as it builds on X-Fields with specific enhancements.

The paper tackles view-time interpolation of stereo videos by addressing X-Fields' limitations with large baseline cameras and non-linear motion, using multi-plane disparities and non-uniform time coordinates. It achieves better results than state-of-the-art methods with near real-time performance and low resource costs.

In this paper, we propose an approach for view-time interpolation of stereo videos. Specifically, we build upon X-Fields that approximates an interpolatable mapping between the input coordinates and 2D RGB images using a convolutional decoder. Our main contribution is to analyze and identify the sources of the problems with using X-Fields in our application and propose novel techniques to overcome these challenges. Specifically, we observe that X-Fields struggles to implicitly interpolate the disparities for large baseline cameras. Therefore, we propose multi-plane disparities to reduce the spatial distance of the objects in the stereo views. Moreover, we propose non-uniform time coordinates to handle the non-linear and sudden motion spikes in videos. We additionally introduce several simple, but important, improvements over X-Fields. We demonstrate that our approach is able to produce better results than the state of the art, while running in near real-time rates and having low memory and storage costs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes