CVLGMLApr 24, 2020

Expanding Sparse Guidance for Stereo Matching

arXiv:2005.02123v1
Originality Incremental advance
AI Analysis

This addresses the challenge of obtaining dense supervision for stereo estimation, offering a domain-adaptation solution for computer vision applications.

The paper tackles the problem of stereo matching by using sparse LiDAR cues to enhance features, reducing disparity errors by over 2 pixels on KITTI Stereo 2012 and 3 pixels on KITTI Stereo 2015.

The performance of image based stereo estimation suffers from lighting variations, repetitive patterns and homogeneous appearance. Moreover, to achieve good performance, stereo supervision requires sufficient densely-labeled data, which are hard to obtain. In this work, we leverage small amount of data with very sparse but accurate disparity cues from LiDAR to bridge the gap. We propose a novel sparsity expansion technique to expand the sparse cues concerning RGB images for local feature enhancement. The feature enhancement method can be easily applied to any stereo estimation algorithms with cost volume at the test stage. Extensive experiments on stereo datasets demonstrate the effectiveness and robustness across different backbones on domain adaption and self-supervision scenario. Our sparsity expansion method outperforms previous methods in terms of disparity by more than 2 pixel error on KITTI Stereo 2012 and 3 pixel error on KITTI Stereo 2015. Our approach significantly boosts the existing state-of-the-art stereo algorithms with extremely sparse cues.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes