CVMar 28, 2022

REGTR: End-to-end Point Cloud Correspondences with Transformers

arXiv:2203.14517v1273 citationsh-index: 43Has Code
Originality Highly original
AI Analysis

This addresses the problem of efficient and accurate point cloud registration for robotics and computer vision applications, offering a novel approach that simplifies the pipeline.

The paper tackles point cloud registration by proposing an end-to-end framework that uses transformers to directly predict correspondences, eliminating the need for explicit feature matching and RANSAC, achieving state-of-the-art performance on 3DMatch and ModelNet benchmarks.

Despite recent success in incorporating learning into point cloud registration, many works focus on learning feature descriptors and continue to rely on nearest-neighbor feature matching and outlier filtering through RANSAC to obtain the final set of correspondences for pose estimation. In this work, we conjecture that attention mechanisms can replace the role of explicit feature matching and RANSAC, and thus propose an end-to-end framework to directly predict the final set of correspondences. We use a network architecture consisting primarily of transformer layers containing self and cross attentions, and train it to predict the probability each point lies in the overlapping region and its corresponding position in the other point cloud. The required rigid transformation can then be estimated directly from the predicted correspondences without further post-processing. Despite its simplicity, our approach achieves state-of-the-art performance on 3DMatch and ModelNet benchmarks. Our source code can be found at https://github.com/yewzijian/RegTR .

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes