CVSep 17, 2025

Gaussian Alignment for Relative Camera Pose Estimation via Single-View Reconstruction

arXiv:2509.13652v1AI
Originality Highly original
AI Analysis

This addresses the problem of metric pose estimation for 3D reconstruction and localization, offering a novel approach that bridges single-view perception with multi-view geometry.

The paper tackles metric relative camera pose estimation from image pairs by introducing GARPS, a training-free framework that aligns independently reconstructed 3D Gaussian Mixture Models, outperforming state-of-the-art methods on the Real-Estate10K dataset.

Estimating metric relative camera pose from a pair of images is of great importance for 3D reconstruction and localisation. However, conventional two-view pose estimation methods are not metric, with camera translation known only up to a scale, and struggle with wide baselines and textureless or reflective surfaces. This paper introduces GARPS, a training-free framework that casts this problem as the direct alignment of two independently reconstructed 3D scenes. GARPS leverages a metric monocular depth estimator and a Gaussian scene reconstructor to obtain a metric 3D Gaussian Mixture Model (GMM) for each image. It then refines an initial pose from a feed-forward two-view pose estimator by optimising a differentiable GMM alignment objective. This objective jointly considers geometric structure, view-independent colour, anisotropic covariance, and semantic feature consistency, and is robust to occlusions and texture-poor regions without requiring explicit 2D correspondences. Extensive experiments on the Real\-Estate10K dataset demonstrate that GARPS outperforms both classical and state-of-the-art learning-based methods, including MASt3R. These results highlight the potential of bridging single-view perception with multi-view geometry to achieve robust and metric relative pose estimation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes