CVDec 14, 2023

iComMa: Inverting 3D Gaussian Splatting for Camera Pose Estimation via Comparing and Matching

arXiv:2312.09031v215 citationsh-index: 8
AI Analysis

This addresses camera pose estimation for computer vision applications, offering an incremental improvement over existing methods by enhancing robustness to large rotations and translations.

The paper tackles the 6D camera pose estimation problem by inverting 3D Gaussian Splatting to optimize camera pose without training, achieving accurate results in challenging conditions with adverse initializations.

We present a method named iComMa to address the 6D camera pose estimation problem in computer vision. Conventional pose estimation methods typically rely on the target's CAD model or necessitate specific network training tailored to particular object classes. Some existing methods have achieved promising results in mesh-free object and scene pose estimation by inverting the Neural Radiance Fields (NeRF). However, they still struggle with adverse initializations such as large rotations and translations. To address this issue, we propose an efficient method for accurate camera pose estimation by inverting 3D Gaussian Splatting (3DGS). Specifically, a gradient-based differentiable framework optimizes camera pose by minimizing the residual between the query image and the rendered image, requiring no training. An end-to-end matching module is designed to enhance the model's robustness against adverse initializations, while minimizing pixel-level comparing loss aids in precise pose estimation. Experimental results on synthetic and complex real-world data demonstrate the effectiveness of the proposed approach in challenging conditions and the accuracy of camera pose estimation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes