CVSep 19, 2023

NDDepth: Normal-Distance Assisted Monocular Depth Estimation

arXiv:2309.10592v277 citationsh-index: 23
Originality Incremental advance
AI Analysis

This work addresses depth estimation for vision applications, offering incremental improvements through a novel hybrid method.

The paper tackles monocular depth estimation by proposing a physics-driven framework that assumes scenes are piece-wise planes, introducing a normal-distance head and a contrastive refinement module, achieving state-of-the-art results with top rankings on benchmarks like KITTI.

Monocular depth estimation has drawn widespread attention from the vision community due to its broad applications. In this paper, we propose a novel physics (geometry)-driven deep learning framework for monocular depth estimation by assuming that 3D scenes are constituted by piece-wise planes. Particularly, we introduce a new normal-distance head that outputs pixel-level surface normal and plane-to-origin distance for deriving depth at each position. Meanwhile, the normal and distance are regularized by a developed plane-aware consistency constraint. We further integrate an additional depth head to improve the robustness of the proposed framework. To fully exploit the strengths of these two heads, we develop an effective contrastive iterative refinement module that refines depth in a complementary manner according to the depth uncertainty. Extensive experiments indicate that the proposed method exceeds previous state-of-the-art competitors on the NYU-Depth-v2, KITTI and SUN RGB-D datasets. Notably, it ranks 1st among all submissions on the KITTI depth prediction online benchmark at the submission time.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes