CVDec 6, 2016

FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

arXiv:1612.01925v13362 citations
Originality Incremental advance
AI Analysis

This improves optical flow estimation for computer vision applications like video analysis and robotics, representing a significant but incremental advancement over prior deep learning methods.

The paper tackled optical flow estimation by advancing end-to-end deep learning approaches, achieving over 50% error reduction compared to the original FlowNet while maintaining similar speed and matching state-of-the-art methods at interactive frame rates.

The FlowNet demonstrated that optical flow estimation can be cast as a learning problem. However, the state of the art with regard to the quality of the flow has still been defined by traditional methods. Particularly on small displacements and real-world data, FlowNet cannot compete with variational methods. In this paper, we advance the concept of end-to-end learning of optical flow and make it work really well. The large improvements in quality and speed are caused by three major contributions: first, we focus on the training data and show that the schedule of presenting data during training is very important. Second, we develop a stacked architecture that includes warping of the second image with intermediate optical flow. Third, we elaborate on small displacements by introducing a sub-network specializing on small motions. FlowNet 2.0 is only marginally slower than the original FlowNet but decreases the estimation error by more than 50%. It performs on par with state-of-the-art methods, while running at interactive frame rates. Moreover, we present faster variants that allow optical flow computation at up to 140fps with accuracy matching the original FlowNet.

Code Implementations12 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes