CVROAug 19, 2021

Retrieval and Localization with Observation Constraints

arXiv:2108.08516v111 citations
Originality Incremental advance
AI Analysis

This addresses accurate localization for applications like augmented reality and robotics, but it is incremental as it modifies existing modules in a pipeline.

The paper tackles visual re-localization by proposing RLOCS, an integrated method combining image retrieval, semantic consistency, and geometry verification, which achieves performance improvements on benchmarks like Aachen Day-Night and InLoc.

Accurate visual re-localization is very critical to many artificial intelligence applications, such as augmented reality, virtual reality, robotics and autonomous driving. To accomplish this task, we propose an integrated visual re-localization method called RLOCS by combining image retrieval, semantic consistency and geometry verification to achieve accurate estimations. The localization pipeline is designed as a coarse-to-fine paradigm. In the retrieval part, we cascade the architecture of ResNet101-GeM-ArcFace and employ DBSCAN followed by spatial verification to obtain a better initial coarse pose. We design a module called observation constraints, which combines geometry information and semantic consistency for filtering outliers. Comprehensive experiments are conducted on open datasets, including retrieval on R-Oxford5k and R-Paris6k, semantic segmentation on Cityscapes, localization on Aachen Day-Night and InLoc. By creatively modifying separate modules in the total pipeline, our method achieves many performance improvements on the challenging localization benchmarks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes