CVSep 12, 2022

CenterFormer: Center-based Transformer for 3D Object Detection

arXiv:2209.05588v1198 citationsh-index: 75Has Code
Originality Highly original
AI Analysis

This addresses the problem of efficient and accurate 3D object detection for autonomous driving systems, representing a strong specific gain rather than a foundational advancement.

The paper tackles 3D object detection from LiDAR point clouds by proposing CenterFormer, a center-based transformer network that reduces computational complexity and convergence difficulty, achieving state-of-the-art performance with 73.7% mAPH on the Waymo validation set and 75.6% mAPH on the test set.

Query-based transformer has shown great potential in constructing long-range attention in many image-domain tasks, but has rarely been considered in LiDAR-based 3D object detection due to the overwhelming size of the point cloud data. In this paper, we propose CenterFormer, a center-based transformer network for 3D object detection. CenterFormer first uses a center heatmap to select center candidates on top of a standard voxel-based point cloud encoder. It then uses the feature of the center candidate as the query embedding in the transformer. To further aggregate features from multiple frames, we design an approach to fuse features through cross-attention. Lastly, regression heads are added to predict the bounding box on the output center feature representation. Our design reduces the convergence difficulty and computational complexity of the transformer structure. The results show significant improvements over the strong baseline of anchor-free object detection networks. CenterFormer achieves state-of-the-art performance for a single model on the Waymo Open Dataset, with 73.7% mAPH on the validation set and 75.6% mAPH on the test set, significantly outperforming all previously published CNN and transformer-based methods. Our code is publicly available at https://github.com/TuSimple/centerformer

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes