ROCVFeb 1, 2020

Leveraging Uncertainties for Deep Multi-modal Object Detection in Autonomous Driving

arXiv:2002.00216v124 citations
AI Analysis

This addresses robust perception for autonomous vehicles, with incremental improvements in fusion methods.

This paper tackles 3D object detection in autonomous driving by combining LiDAR and camera data with uncertainty modeling, achieving up to 7% higher Average Precision than baselines and up to 20% improvement under sensor misalignment.

This work presents a probabilistic deep neural network that combines LiDAR point clouds and RGB camera images for robust, accurate 3D object detection. We explicitly model uncertainties in the classification and regression tasks, and leverage uncertainties to train the fusion network via a sampling mechanism. We validate our method on three datasets with challenging real-world driving scenarios. Experimental results show that the predicted uncertainties reflect complex environmental uncertainty like difficulties of a human expert to label objects. The results also show that our method consistently improves the Average Precision by up to 7% compared to the baseline method. When sensors are temporally misaligned, the sampling method improves the Average Precision by up to 20%, showing its high robustness against noisy sensor inputs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes