CVLGROMay 29, 2020

Unconstrained Matching of 2D and 3D Descriptors for 6-DOF Pose Estimation

arXiv:2005.14502v11 citations
Originality Highly original
AI Analysis

This addresses the challenge of unconstrained camera localization in 3D environments for applications like robotics or augmented reality, representing a novel method rather than an incremental improvement.

The paper tackles the problem of directly matching 2D image descriptors with 3D point cloud descriptors for 6-DOF pose estimation, demonstrating that this approach is viable and achieves high precision in indoor and outdoor scenarios.

This paper proposes a novel concept to directly match feature descriptors extracted from 2D images with feature descriptors extracted from 3D point clouds. We use this concept to directly localize images in a 3D point cloud. We generate a dataset of matching 2D and 3D points and their corresponding feature descriptors, which is used to learn a Descriptor-Matcher classifier. To localize the pose of an image at test time, we extract keypoints and feature descriptors from the query image. The trained Descriptor-Matcher is then used to match the features from the image and the point cloud. The locations of the matched features are used in a robust pose estimation algorithm to predict the location and orientation of the query image. We carried out an extensive evaluation of the proposed method for indoor and outdoor scenarios and with different types of point clouds to verify the feasibility of our approach. Experimental results demonstrate that direct matching of feature descriptors from images and point clouds is not only a viable idea but can also be reliably used to estimate the 6-DOF poses of query cameras in any type of 3D point cloud in an unconstrained manner with high precision.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes