CVAIMar 14, 2021

Radar Camera Fusion via Representation Learning in Autonomous Driving

arXiv:2103.07825v357 citations
Originality Incremental advance
AI Analysis

This addresses the perception challenge in autonomous driving systems by improving sensor fusion accuracy, though it is incremental as it builds on existing fusion techniques.

The paper tackles the problem of accurate data association between radar and camera sensors in autonomous driving by proposing a deep representation learning method, achieving a 92.2% F1 score, which is 11.6% higher than a rule-based baseline.

Radars and cameras are mature, cost-effective, and robust sensors and have been widely used in the perception stack of mass-produced autonomous driving systems. Due to their complementary properties, outputs from radar detection (radar pins) and camera perception (2D bounding boxes) are usually fused to generate the best perception results. The key to successful radar-camera fusion is the accurate data association. The challenges in the radar-camera association can be attributed to the complexity of driving scenes, the noisy and sparse nature of radar measurements, and the depth ambiguity from 2D bounding boxes. Traditional rule-based association methods are susceptible to performance degradation in challenging scenarios and failure in corner cases. In this study, we propose to address radar-camera association via deep representation learning, to explore feature-level interaction and global reasoning. Additionally, we design a loss sampling mechanism and an innovative ordinal loss to overcome the difficulty of imperfect labeling and to enforce critical human-like reasoning. Despite being trained with noisy labels generated by a rule-based algorithm, our proposed method achieves a performance of 92.2% F1 score, which is 11.6% higher than the rule-based teacher. Moreover, this data-driven method also lends itself to continuous improvement via corner case mining.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes