CVJan 22, 2020

ResDepth: Learned Residual Stereo Reconstruction

arXiv:2001.08026v30.0025 citations
AI Analysis55

This work provides a significant improvement in stereo reconstruction accuracy for applications like satellite imaging, though it is incremental as it builds on existing stereo matchers.

The paper tackles the problem of dense stereo reconstruction by proposing a residual learning scheme that enhances approximate reconstructions using a deep network, achieving a >50% reduction in mean absolute error on satellite stereo and the ETH3D benchmark.

We propose an embarrassingly simple but very effective scheme for high-quality dense stereo reconstruction: (i) generate an approximate reconstruction with your favourite stereo matcher; (ii) rewarp the input images with that approximate model; (iii) with the initial reconstruction and the warped images as input, train a deep network to enhance the reconstruction by regressing a residual correction; and (iv) if desired, iterate the refinement with the new, improved reconstruction. The strategy to only learn the residual greatly simplifies the learning problem. A standard Unet without bells and whistles is enough to reconstruct even small surface details, like dormers and roof substructures in satellite images. We also investigate residual reconstruction with less information and find that even a single image is enough to greatly improve an approximate reconstruction. Our full model reduces the mean absolute error of state-of-the-art stereo reconstruction systems by >50%, both in our target domain of satellite stereo and on stereo pairs from the ETH3D benchmark.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes