CVSep 14, 2022

DevNet: Self-supervised Monocular Depth Learning via Density Volume Construction

arXiv:2209.06351v439 citationsh-index: 57Has Code
Originality Incremental advance
AI Analysis

This work improves depth estimation accuracy for applications like robotics and autonomous driving, though it is incremental over existing self-supervised methods.

The paper tackles the problem of self-supervised monocular depth learning by addressing limitations in exploiting 3D geometric correspondences and handling photometric ambiguities, resulting in a 4% reduction in root-mean-square-deviation on KITTI-2015 and NYU-V2 datasets.

Self-supervised depth learning from monocular images normally relies on the 2D pixel-wise photometric relation between temporally adjacent image frames. However, they neither fully exploit the 3D point-wise geometric correspondences, nor effectively tackle the ambiguities in the photometric warping caused by occlusions or illumination inconsistency. To address these problems, this work proposes Density Volume Construction Network (DevNet), a novel self-supervised monocular depth learning framework, that can consider 3D spatial information, and exploit stronger geometric constraints among adjacent camera frustums. Instead of directly regressing the pixel value from a single image, our DevNet divides the camera frustum into multiple parallel planes and predicts the pointwise occlusion probability density on each plane. The final depth map is generated by integrating the density along corresponding rays. During the training process, novel regularization strategies and loss functions are introduced to mitigate photometric ambiguities and overfitting. Without obviously enlarging model parameters size or running time, DevNet outperforms several representative baselines on both the KITTI-2015 outdoor dataset and NYU-V2 indoor dataset. In particular, the root-mean-square-deviation is reduced by around 4% with DevNet on both KITTI-2015 and NYU-V2 in the task of depth estimation. Code is available at https://github.com/gitkaichenzhou/DevNet.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes