CVJan 13, 2025

RePoseD: Efficient Relative Pose Estimation With Known Depth Information

arXiv:2501.07742v34 citationsh-index: 11
Originality Incremental advance
AI Analysis

This work addresses the problem of improving camera pose estimation for computer vision applications, offering incremental advancements by integrating depth information into existing frameworks.

The paper tackles relative pose estimation by incorporating monocular depth estimates, proposing efficient solvers that jointly estimate scale or shift parameters with the pose, and shows they outperform state-of-the-art depth-aware methods in speed and accuracy across multiple datasets.

Recent advances in monocular depth estimation methods (MDE) and their improved accuracy open new possibilities for their applications. In this paper, we investigate how monocular depth estimates can be used for relative pose estimation. In particular, we are interested in answering the question whether using MDEs improves results over traditional point-based methods. We propose a novel framework for estimating the relative pose of two cameras from point correspondences with associated monocular depths. Since depth predictions are typically defined up to an unknown scale or even both unknown scale and shift parameters, our solvers jointly estimate the scale or both the scale and shift parameters along with the relative pose. We derive efficient solvers considering different types of depths for three camera configurations: (1) two calibrated cameras, (2) two cameras with an unknown shared focal length, and (3) two cameras with unknown different focal lengths. Our new solvers outperform state-of-the-art depth-aware solvers in terms of speed and accuracy. In extensive real experiments on multiple datasets and with various MDEs, we discuss which depth-aware solvers are preferable in which situation. The code will be made publicly available.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes