CVMar 18, 2019

Bilateral Cyclic Constraint and Adaptive Regularization for Unsupervised Monocular Depth Prediction

arXiv:1903.07309v397 citations
Originality Highly original
AI Analysis

This addresses the problem of costly supervised depth estimation for applications like autonomous driving by providing an unsupervised method that reduces the need for per-pixel ground-truth data.

The paper tackles unsupervised monocular depth prediction by training a network with stereo pairs but using only a single image at test time, introducing a bilateral cyclic constraint and adaptive regularization to handle occlusions, and it outperforms state-of-the-art methods on KITTI benchmarks with demonstrated generalization to the Make3d dataset.

Supervised learning methods to infer (hypothesize) depth of a scene from a single image require costly per-pixel ground-truth. We follow a geometric approach that exploits abundant stereo imagery to learn a model to hypothesize scene structure without direct supervision. Although we train a network with stereo pairs, we only require a single image at test time to hypothesize disparity or depth. We propose a novel objective function that exploits the bilateral cyclic relationship between the left and right disparities and we introduce an adaptive regularization scheme that allows the network to handle both the co-visible and occluded regions in a stereo pair. This process ultimately produces a model to generate hypotheses for the 3-dimensional structure of the scene as viewed in a single image. When used to generate a single (most probable) estimate of depth, our method outperforms state-of-the-art unsupervised monocular depth prediction methods on the KITTI benchmarks. We show that our method generalizes well by applying our models trained on KITTI to the Make3d dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes