CVApr 24, 2025

A Guide to Structureless Visual Localization

arXiv:2504.17636v15 citationsh-index: 6
Originality Synthesis-oriented
AI Analysis

This work addresses the need for more flexible visual localization systems in applications like self-driving cars and augmented reality, though it is incremental as it provides a comprehensive review and comparison rather than introducing a new method.

The paper tackles the problem of visual localization by comparing structureless methods, which use image databases for flexibility, against structure-based approaches, finding that classical geometric methods achieve higher accuracy than recent pose regression techniques but slightly lag behind structure-based methods in pose accuracy.

Visual localization algorithms, i.e., methods that estimate the camera pose of a query image in a known scene, are core components of many applications, including self-driving cars and augmented / mixed reality systems. State-of-the-art visual localization algorithms are structure-based, i.e., they store a 3D model of the scene and use 2D-3D correspondences between the query image and 3D points in the model for camera pose estimation. While such approaches are highly accurate, they are also rather inflexible when it comes to adjusting the underlying 3D model after changes in the scene. Structureless localization approaches represent the scene as a database of images with known poses and thus offer a much more flexible representation that can be easily updated by adding or removing images. Although there is a large amount of literature on structure-based approaches, there is significantly less work on structureless methods. Hence, this paper is dedicated to providing the, to the best of our knowledge, first comprehensive discussion and comparison of structureless methods. Extensive experiments show that approaches that use a higher degree of classical geometric reasoning generally achieve higher pose accuracy. In particular, approaches based on classical absolute or semi-generalized relative pose estimation outperform very recent methods based on pose regression by a wide margin. Compared with state-of-the-art structure-based approaches, the flexibility of structureless methods comes at the cost of (slightly) lower pose accuracy, indicating an interesting direction for future work.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes