CVLGNESep 15, 2014

Computing the Stereo Matching Cost with a Convolutional Neural Network

arXiv:1409.4326v2816 citations
Originality Highly original
AI Analysis

This work addresses depth extraction for computer vision applications, representing an incremental improvement in stereo matching methods.

The paper tackled stereo depth estimation by training a convolutional neural network to compute matching costs, achieving an error rate of 2.61% on the KITTI dataset and top performance at the time.

We present a method for extracting depth information from a rectified image pair. We train a convolutional neural network to predict how well two image patches match and use it to compute the stereo matching cost. The cost is refined by cross-based cost aggregation and semiglobal matching, followed by a left-right consistency check to eliminate errors in the occluded regions. Our stereo method achieves an error rate of 2.61 % on the KITTI stereo dataset and is currently (August 2014) the top performing method on this dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes