CVJun 3, 2018

ProFlow: Learning to Predict Optical Flow

arXiv:1806.00800v143 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of leveraging temporal coherence for more accurate optical flow, particularly in non-rigid motion scenarios, with incremental improvements over existing methods.

The paper tackles the problem of optical flow estimation by proposing an unsupervised online learning approach using a CNN to predict motion models per frame, achieving up to 27% improvement on benchmarks like KITTI and MPI Sintel.

Temporal coherence is a valuable source of information in the context of optical flow estimation. However, finding a suitable motion model to leverage this information is a non-trivial task. In this paper we propose an unsupervised online learning approach based on a convolutional neural network (CNN) that estimates such a motion model individually for each frame. By relating forward and backward motion these learned models not only allow to infer valuable motion information based on the backward flow, they also help to improve the performance at occlusions, where a reliable prediction is particularly useful. Moreover, our learned models are spatially variant and hence allow to estimate non-rigid motion per construction. This, in turns, allows to overcome the major limitation of recent rigidity-based approaches that seek to improve the estimation by incorporating additional stereo/SfM constraints. Experiments demonstrate the usefulness of our new approach. They not only show a consistent improvement of up to 27% for all major benchmarks (KITTI 2012, KITTI 2015, MPI Sintel) compared to a baseline without prediction, they also show top results for the MPI Sintel benchmark -- the one of the three benchmarks that contains the largest amount of non-rigid motion.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes