CVApr 12, 2025

RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object Detection

arXiv:2504.09086v19 citationsh-index: 6Has CodeCVPR
Originality Incremental advance
AI Analysis

This work addresses the problem of robust 3D object detection in autonomous driving by enhancing radar-camera fusion, representing an incremental improvement over existing methods.

The paper tackles the challenge of radar-camera fusion for 3D object detection by explicitly modeling radar hit distributions to improve matching, achieving state-of-the-art performance on the nuScenes dataset.

Radar hits reflect from points on both the boundary and internal to object outlines. This results in a complex distribution of radar hits that depends on factors including object category, size, and orientation. Current radar-camera fusion methods implicitly account for this with a black-box neural network. In this paper, we explicitly utilize a radar hit distribution model to assist fusion. First, we build a model to predict radar hit distributions conditioned on object properties obtained from a monocular detector. Second, we use the predicted distribution as a kernel to match actual measured radar points in the neighborhood of the monocular detections, generating matching scores at nearby positions. Finally, a fusion stage combines context with the kernel detector to refine the matching scores. Our method achieves the state-of-the-art radar-camera detection performance on nuScenes. Our source code is available at https://github.com/longyunf/riccardo.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes