RO CVFeb 1, 2020

Leveraging Uncertainties for Deep Multi-modal Object Detection in Autonomous Driving

Di Feng, Yifan Cao, Lars Rosenbaum, Fabian Timm, Klaus Dietmayer

arXiv:2002.00216v115.724 citations

Originality Incremental advance

AI Analysis

This addresses robust perception for autonomous vehicles, with incremental improvements in fusion methods.

This paper tackles 3D object detection in autonomous driving by combining LiDAR and camera data with uncertainty modeling, achieving up to 7% higher Average Precision than baselines and up to 20% improvement under sensor misalignment.

This work presents a probabilistic deep neural network that combines LiDAR point clouds and RGB camera images for robust, accurate 3D object detection. We explicitly model uncertainties in the classification and regression tasks, and leverage uncertainties to train the fusion network via a sampling mechanism. We validate our method on three datasets with challenging real-world driving scenarios. Experimental results show that the predicted uncertainties reflect complex environmental uncertainty like difficulties of a human expert to label objects. The results also show that our method consistently improves the Average Precision by up to 7% compared to the baseline method. When sensors are temporally misaligned, the sampling method improves the Average Precision by up to 20%, showing its high robustness against noisy sensor inputs.

View on arXiv PDF

Similar