CVApr 16, 2019

Complexer-YOLO: Real-Time 3D Object Detection and Tracking on Semantic Point Clouds

arXiv:1904.07537v1222 citations
Originality Incremental advance
AI Analysis

This addresses the problem of accurate and efficient 3D perception for autonomous cars, though it is incremental as it builds on existing methods.

The paper tackles 3D object detection for autonomous driving by fusing neural network-based 3D detection with visual semantic segmentation, achieving results comparable to state-of-the-art on KITTI while running in real-time.

Accurate detection of 3D objects is a fundamental problem in computer vision and has an enormous impact on autonomous cars, augmented/virtual reality and many applications in robotics. In this work we present a novel fusion of neural network based state-of-the-art 3D detector and visual semantic segmentation in the context of autonomous driving. Additionally, we introduce Scale-Rotation-Translation score (SRTs), a fast and highly parameterizable evaluation metric for comparison of object detections, which speeds up our inference time up to 20\% and halves training time. On top, we apply state-of-the-art online multi target feature tracking on the object measurements to further increase accuracy and robustness utilizing temporal information. Our experiments on KITTI show that we achieve same results as state-of-the-art in all related categories, while maintaining the performance and accuracy trade-off and still run in real-time. Furthermore, our model is the first one that fuses visual semantic with 3D object detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes