ROCVMar 18, 2024

BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation

arXiv:2403.11761v248 citationsh-index: 40Has CodeIROS
Originality Incremental advance
AI Analysis

This work addresses a critical challenge for mobile robots in autonomous driving by providing a cost-effective alternative to LiDAR for enhanced perception in poor weather, though it is incremental as it builds on existing fusion approaches.

The paper tackles the problem of semantic scene segmentation from a bird's-eye-view perspective under adverse conditions like rain or nighttime by fusing camera and radar data, introducing BEVCar, which outperforms state-of-the-art methods on the nuScenes dataset and improves robustness and performance for distant objects.

Semantic scene segmentation from a bird's-eye-view (BEV) perspective plays a crucial role in facilitating planning and decision-making for mobile robots. Although recent vision-only methods have demonstrated notable advancements in performance, they often struggle under adverse illumination conditions such as rain or nighttime. While active sensors offer a solution to this challenge, the prohibitively high cost of LiDARs remains a limiting factor. Fusing camera data with automotive radars poses a more inexpensive alternative but has received less attention in prior research. In this work, we aim to advance this promising avenue by introducing BEVCar, a novel approach for joint BEV object and map segmentation. The core novelty of our approach lies in first learning a point-based encoding of raw radar data, which is then leveraged to efficiently initialize the lifting of image features into the BEV space. We perform extensive experiments on the nuScenes dataset and demonstrate that BEVCar outperforms the current state of the art. Moreover, we show that incorporating radar information significantly enhances robustness in challenging environmental conditions and improves segmentation performance for distant objects. To foster future research, we provide the weather split of the nuScenes dataset used in our experiments, along with our code and trained models at http://bevcar.cs.uni-freiburg.de.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes