CVROSep 27, 2017

Globally-Optimal Inlier Set Maximisation for Simultaneous Camera Pose and Feature Correspondence

arXiv:1709.09384v161 citations
Originality Highly original
AI Analysis

This addresses the challenge of camera pose estimation in computer vision, particularly in scenarios with many outliers, offering a globally-optimal solution without needing good initializations, though it is incremental in improving robustness over prior methods.

The paper tackles the problem of simultaneously estimating camera pose and feature correspondences from a single image and 3D point-set, proposing a globally-optimal method that maximizes inlier set cardinality using branch-and-bound, which outperforms existing approaches in robustness and optimality.

Estimating the 6-DoF pose of a camera from a single image relative to a pre-computed 3D point-set is an important task for many computer vision applications. Perspective-n-Point (PnP) solvers are routinely used for camera pose estimation, provided that a good quality set of 2D-3D feature correspondences are known beforehand. However, finding optimal correspondences between 2D key-points and a 3D point-set is non-trivial, especially when only geometric (position) information is known. Existing approaches to the simultaneous pose and correspondence problem use local optimisation, and are therefore unlikely to find the optimal solution without a good pose initialisation, or introduce restrictive assumptions. Since a large proportion of outliers are common for this problem, we instead propose a globally-optimal inlier set cardinality maximisation approach which jointly estimates optimal camera pose and optimal correspondences. Our approach employs branch-and-bound to search the 6D space of camera poses, guaranteeing global optimality without requiring a pose prior. The geometry of SE(3) is used to find novel upper and lower bounds for the number of inliers and local optimisation is integrated to accelerate convergence. The evaluation empirically supports the optimality proof and shows that the method performs much more robustly than existing approaches, including on a large-scale outdoor data-set.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes