CVDec 4, 2017

Learning for Disparity Estimation through Feature Constancy

arXiv:1712.01039v2131 citations
Originality Incremental advance
AI Analysis

This work addresses stereo matching for computer vision applications, presenting an incremental improvement by integrating all steps into a single network.

The authors tackled the problem of stereo matching by proposing a network architecture that incorporates all four steps of the process, achieving state-of-the-art performance on the KITTI 2012 and KITTI 2015 benchmarks with fast running times.

Stereo matching algorithms usually consist of four steps, including matching cost calculation, matching cost aggregation, disparity calculation, and disparity refinement. Existing CNN-based methods only adopt CNN to solve parts of the four steps, or use different networks to deal with different steps, making them difficult to obtain the overall optimal solution. In this paper, we propose a network architecture to incorporate all steps of stereo matching. The network consists of three parts. The first part calculates the multi-scale shared features. The second part performs matching cost calculation, matching cost aggregation and disparity calculation to estimate the initial disparity using shared features. The initial disparity and the shared features are used to calculate the feature constancy that measures correctness of the correspondence between two input images. The initial disparity and the feature constancy are then fed to a sub-network to refine the initial disparity. The proposed method has been evaluated on the Scene Flow and KITTI datasets. It achieves the state-of-the-art performance on the KITTI 2012 and KITTI 2015 benchmarks while maintaining a very fast running time.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes