CVAug 7, 2022

Global Hierarchical Attention for 3D Point Cloud Analysis

arXiv:2208.03791v1h-index: 19
Originality Incremental advance
AI Analysis

This addresses the problem of scaling attention mechanisms for large point clouds in computer vision, offering incremental improvements to existing methods.

The paper tackles the challenge of efficient global attention in 3D point cloud analysis by proposing Global Hierarchical Attention (GHA), which achieves linear complexity and improves performance, such as a +1.7% mIoU increase for semantic segmentation on ScanNet and up to +2.1% mAP25 for object detection.

We propose a new attention mechanism, called Global Hierarchical Attention (GHA), for 3D point cloud analysis. GHA approximates the regular global dot-product attention via a series of coarsening and interpolation operations over multiple hierarchy levels. The advantage of GHA is two-fold. First, it has linear complexity with respect to the number of points, enabling the processing of large point clouds. Second, GHA inherently possesses the inductive bias to focus on spatially close points, while retaining the global connectivity among all points. Combined with a feedforward network, GHA can be inserted into many existing network architectures. We experiment with multiple baseline networks and show that adding GHA consistently improves performance across different tasks and datasets. For the task of semantic segmentation, GHA gives a +1.7% mIoU increase to the MinkowskiEngine baseline on ScanNet. For the 3D object detection task, GHA improves the CenterPoint baseline by +0.5% mAP on the nuScenes dataset, and the 3DETR baseline by +2.1% mAP25 and +1.5% mAP50 on ScanNet.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes