CVMar 8, 2022

A Lightweight and Detector-free 3D Single Object Tracker on Point Clouds

Yan Xia, Qiangqiang Wu, Wei Li, Antoni B. Chan, Uwe Stilla

arXiv:2203.04232v215.356 citationsh-index: 55Has Code

Originality Highly original

AI Analysis

This work addresses the challenge of accurate 3D tracking for autonomous driving systems by offering a faster and more efficient alternative to detector-based approaches.

The paper tackles the problem of 3D single object tracking in sparse and incomplete point clouds by proposing a detector-free method that leverages temporal motion cues, resulting in a 10% performance improvement on the NuScenes dataset and a tracking speed of 72 FPS.

Recent works on 3D single object tracking treat the task as a target-specific 3D detection task, where an off-the-shelf 3D detector is commonly employed for the tracking. However, it is non-trivial to perform accurate target-specific detection since the point cloud of objects in raw LiDAR scans is usually sparse and incomplete. In this paper, we address this issue by explicitly leveraging temporal motion cues and propose DMT, a Detector-free Motion-prediction-based 3D Tracking network that completely removes the usage of complicated 3D detectors and is lighter, faster, and more accurate than previous trackers. Specifically, the motion prediction module is first introduced to estimate a potential target center of the current frame in a point-cloud-free manner. Then, an explicit voting module is proposed to directly regress the 3D box from the estimated target center. Extensive experiments on KITTI and NuScenes datasets demonstrate that our DMT can still achieve better performance (~10% improvement over the NuScenes dataset) and a faster tracking speed (i.e., 72 FPS) than state-of-the-art approaches without applying any complicated 3D detectors. Our code is released at \url{https://github.com/jimmy-dq/DMT}

View on arXiv PDF Code

Similar