CVJul 27, 2021

DV-Det: Efficient 3D Point Cloud Object Detection with Dynamic Voxelization

arXiv:2107.12707v1
Originality Highly original
AI Analysis

This addresses the problem of slow 3D object detection for autonomous driving applications, offering a novel method for efficiency.

The paper tackles efficient 3D point cloud object detection by proposing a two-stage framework with dynamic voxelization, achieving 75 FPS on KITTI and 25 FPS on Waymo with satisfactory accuracy.

In this work, we propose a novel two-stage framework for the efficient 3D point cloud object detection. Instead of transforming point clouds into 2D bird eye view projections, we parse the raw point cloud data directly in the 3D space yet achieve impressive efficiency and accuracy. To achieve this goal, we propose dynamic voxelization, a method that voxellizes points at local scale on-the-fly. By doing so, we preserve the point cloud geometry with 3D voxels, and therefore waive the dependence on expensive MLPs to learn from point coordinates. On the other hand, we inherently still follow the same processing pattern as point-wise methods (e.g., PointNet) and no longer suffer from the quantization issue like conventional convolutions. For further speed optimization, we propose the grid-based downsampling and voxelization method, and provide different CUDA implementations to accommodate to the discrepant requirements during training and inference phases. We highlight our efficiency on KITTI 3D object detection dataset with 75 FPS and on Waymo Open dataset with 25 FPS inference speed with satisfactory accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes