CVROSep 11, 2024

FaVoR: Features via Voxel Rendering for Camera Relocalization

arXiv:2409.07571v41 citationsh-index: 7
Originality Incremental advance
AI Analysis

This addresses robustness issues in feature-based camera relocalization for applications like robotics and AR/VR, though it is an incremental improvement over existing sparse feature matching approaches.

The paper tackles camera relocalization under significant viewpoint and appearance changes by proposing a method that uses a sparse voxel map to render image patch descriptors for unseen views, achieving up to 39% improvement in median translation error on indoor datasets and comparable outdoor results with lower costs.

Camera relocalization methods range from dense image alignment to direct camera pose regression from a query image. Among these, sparse feature matching stands out as an efficient, versatile, and generally lightweight approach with numerous applications. However, feature-based methods often struggle with significant viewpoint and appearance changes, leading to matching failures and inaccurate pose estimates. To overcome this limitation, we propose a novel approach that leverages a globally sparse yet locally dense 3D representation of 2D features. By tracking and triangulating landmarks over a sequence of frames, we construct a sparse voxel map optimized to render image patch descriptors observed during tracking. Given an initial pose estimate, we first synthesize descriptors from the voxels using volumetric rendering and then perform feature matching to estimate the camera pose. This methodology enables the generation of descriptors for unseen views, enhancing robustness to view changes. We extensively evaluate our method on the 7-Scenes and Cambridge Landmarks datasets. Our results show that our method significantly outperforms existing state-of-the-art feature representation techniques in indoor environments, achieving up to a 39% improvement in median translation error. Additionally, our approach yields comparable results to other methods for outdoor scenarios while maintaining lower memory and computational costs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes