CVJan 15, 2024

CascadeV-Det: Cascade Point Voting for 3D Object Detection

arXiv:2401.07477v11 citationsh-index: 3Has Code
Originality Incremental advance
AI Analysis

This work improves 3D object detection accuracy for applications like robotics and autonomous driving, though it is incremental as it builds on existing anchor-free methods.

The paper tackles the challenge of inaccurate bounding box regression in anchor-free 3D object detectors due to points being far from ground truth centers, proposing a Cascade Voting strategy that achieves state-of-the-art results with 70.4% mAP@0.25 and 51.6% mAP@0.5 on SUN RGB-D.

Anchor-free object detectors are highly efficient in performing point-based prediction without the need for extra post-processing of anchors. However, different from the 2D grids, the 3D points used in these detectors are often far from the ground truth center, making it challenging to accurately regress the bounding boxes. To address this issue, we propose a Cascade Voting (CascadeV) strategy that provides high-quality 3D object detection with point-based prediction. Specifically, CascadeV performs cascade detection using a novel Cascade Voting decoder that combines two new components: Instance Aware Voting (IA-Voting) and a Cascade Point Assignment (CPA) module. The IA-Voting module updates the object features of updated proposal points within the bounding box using conditional inverse distance weighting. This approach prevents features from being aggregated outside the instance and helps improve the accuracy of object detection. Additionally, since model training can suffer from a lack of proposal points with high centerness, we have developed the CPA module to narrow down the positive assignment threshold with cascade stages. This approach relaxes the dependence on proposal centerness in the early stages while ensuring an ample quantity of positives with high centerness in the later stages. Experiments show that FCAF3D with our CascadeV achieves state-of-the-art 3D object detection results with 70.4\% mAP@0.25 and 51.6\% mAP@0.5 on SUN RGB-D and competitive results on ScanNet. Code will be released at https://github.com/Sharpiless/CascadeV-Det

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes