CVMLJun 4, 2018

Digging Into Self-Supervised Monocular Depth Estimation

arXiv:1806.01260v4494 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of scalable depth estimation for applications like autonomous driving, though it is incremental as it builds on existing self-supervised methods.

The paper tackled the problem of monocular depth estimation without ground-truth depth data by proposing a simple model with three improvements: a minimum reprojection loss for occlusions, a full-resolution multi-scale sampling method to reduce artifacts, and an auto-masking loss to ignore invalid pixels, achieving state-of-the-art results on the KITTI benchmark.

Per-pixel ground-truth depth data is challenging to acquire at scale. To overcome this limitation, self-supervised learning has emerged as a promising alternative for training models to perform monocular depth estimation. In this paper, we propose a set of improvements, which together result in both quantitatively and qualitatively improved depth maps compared to competing self-supervised methods. Research on self-supervised monocular training usually explores increasingly complex architectures, loss functions, and image formation models, all of which have recently helped to close the gap with fully-supervised methods. We show that a surprisingly simple model, and associated design choices, lead to superior predictions. In particular, we propose (i) a minimum reprojection loss, designed to robustly handle occlusions, (ii) a full-resolution multi-scale sampling method that reduces visual artifacts, and (iii) an auto-masking loss to ignore training pixels that violate camera motion assumptions. We demonstrate the effectiveness of each component in isolation, and show high quality, state-of-the-art results on the KITTI benchmark.

Code Implementations15 repos

Data from Papers with Code (CC-BY-SA-4.0)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes