CVIVAug 7, 2018

YOLO3D: End-to-end real-time 3D Oriented Object Bounding Box Detection from LiDAR Point Cloud

arXiv:1808.02350v1169 citations
Originality Incremental advance
AI Analysis

This addresses the need for fast 3D object detection in automated driving, though it is incremental as it builds on existing 2D methods.

The paper tackles real-time 3D oriented object bounding box detection from LiDAR point clouds for automated driving by extending the YOLO v2 loss function to include yaw angle, 3D box center, and height, achieving 40 fps on a Titan X GPU on the KITTI benchmark.

Object detection and classification in 3D is a key task in Automated Driving (AD). LiDAR sensors are employed to provide the 3D point cloud reconstruction of the surrounding environment, while the task of 3D object bounding box detection in real time remains a strong algorithmic challenge. In this paper, we build on the success of the one-shot regression meta-architecture in the 2D perspective image space and extend it to generate oriented 3D object bounding boxes from LiDAR point cloud. Our main contribution is in extending the loss function of YOLO v2 to include the yaw angle, the 3D box center in Cartesian coordinates and the height of the box as a direct regression problem. This formulation enables real-time performance, which is essential for automated driving. Our results are showing promising figures on KITTI benchmark, achieving real-time performance (40 fps) on Titan X GPU.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes