CVMar 28, 2022

REGTR: End-to-end Point Cloud Correspondences with Transformers

arXiv:2203.14517v129.5273 citationsh-index: 47Has Code

Originality Highly original

AI Analysis

This addresses the problem of efficient and accurate point cloud registration for robotics and computer vision applications, offering a novel approach that simplifies the pipeline.

The paper tackles point cloud registration by proposing an end-to-end framework that uses transformers to directly predict correspondences, eliminating the need for explicit feature matching and RANSAC, achieving state-of-the-art performance on 3DMatch and ModelNet benchmarks.

Despite recent success in incorporating learning into point cloud registration, many works focus on learning feature descriptors and continue to rely on nearest-neighbor feature matching and outlier filtering through RANSAC to obtain the final set of correspondences for pose estimation. In this work, we conjecture that attention mechanisms can replace the role of explicit feature matching and RANSAC, and thus propose an end-to-end framework to directly predict the final set of correspondences. We use a network architecture consisting primarily of transformer layers containing self and cross attentions, and train it to predict the probability each point lies in the overlapping region and its corresponding position in the other point cloud. The required rigid transformation can then be estimated directly from the predicted correspondences without further post-processing. Despite its simplicity, our approach achieves state-of-the-art performance on 3DMatch and ModelNet benchmarks. Our source code can be found at https://github.com/yewzijian/RegTR .

View on arXiv PDF Code

Similar