CVSep 24, 2018

Sparse-to-Continuous: Enhancing Monocular Depth Estimation using Occupancy Maps

arXiv:1809.09061v319 citations
Originality Incremental advance
AI Analysis

It addresses depth estimation for outdoor scenes where LiDAR data is sparse, offering an incremental enhancement to training data quality.

This paper tackles the problem of single image depth estimation by proposing a densification method for sparse LiDAR depth maps using Hilbert Maps, which improves depth prediction quality without modifying the neural network architecture. Experiments on KITTI dataset subsets show significant improvement, though no concrete numbers are provided.

This paper addresses the problem of single image depth estimation (SIDE), focusing on improving the quality of deep neural network predictions. In a supervised learning scenario, the quality of predictions is intrinsically related to the training labels, which guide the optimization process. For indoor scenes, structured-light-based depth sensors (e.g. Kinect) are able to provide dense, albeit short-range, depth maps. On the other hand, for outdoor scenes, LiDARs are considered the standard sensor, which comparatively provides much sparser measurements, especially in areas further away. Rather than modifying the neural network architecture to deal with sparse depth maps, this article introduces a novel densification method for depth maps, using the Hilbert Maps framework. A continuous occupancy map is produced based on 3D points from LiDAR scans, and the resulting reconstructed surface is projected into a 2D depth map with arbitrary resolution. Experiments conducted with various subsets of the KITTI dataset show a significant improvement produced by the proposed Sparse-to-Continuous technique, without the introduction of extra information into the training stage.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes