CVLGMLDec 7, 2015

A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

arXiv:1512.02134v12991 citations
Originality Incremental advance
AI Analysis

This work addresses the need for large-scale datasets to train and evaluate scene flow methods, which is a domain-specific advancement for computer vision.

The paper tackles the problem of training convolutional networks for disparity, optical flow, and scene flow estimation by introducing three large synthetic stereo video datasets, and it presents a network for real-time disparity estimation that achieves state-of-the-art results, along with the first convolutional network for scene flow estimation.

Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we propose three synthetic stereo video datasets with sufficient realism, variation, and size to successfully train large networks. Our datasets are the first large-scale datasets to enable training and evaluating scene flow methods. Besides the datasets, we present a convolutional network for real-time disparity estimation that provides state-of-the-art results. By combining a flow and disparity estimation network and training it jointly, we demonstrate the first scene flow estimation with a convolutional network.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes