CVJun 5, 2023

MoDAR: Using Motion Forecasting for 3D Object Detection in Point Cloud Sequences

arXiv:2306.03206v120 citationsh-index: 30
Originality Highly original
AI Analysis

This work addresses the problem of 3D object detection in autonomous driving by enhancing performance for occluded and distant objects, representing an incremental improvement through a novel fusion approach.

The paper tackles the challenge of detecting occluded and long-range objects in 3D point cloud sequences by proposing MoDAR, which uses motion forecasting to create virtual points that augment LiDAR data, significantly improving prior detectors on the Waymo Open Dataset with minimal computational overhead.

Occluded and long-range objects are ubiquitous and challenging for 3D object detection. Point cloud sequence data provide unique opportunities to improve such cases, as an occluded or distant object can be observed from different viewpoints or gets better visibility over time. However, the efficiency and effectiveness in encoding long-term sequence data can still be improved. In this work, we propose MoDAR, using motion forecasting outputs as a type of virtual modality, to augment LiDAR point clouds. The MoDAR modality propagates object information from temporal contexts to a target frame, represented as a set of virtual points, one for each object from a waypoint on a forecasted trajectory. A fused point cloud of both raw sensor points and the virtual points can then be fed to any off-the-shelf point-cloud based 3D object detector. Evaluated on the Waymo Open Dataset, our method significantly improves prior art detectors by using motion forecasting from extra-long sequences (e.g. 18 seconds), achieving new state of the arts, while not adding much computation overhead.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes