CVDec 15, 2017

Semantic Visual Localization

arXiv:1712.05773v2277 citations
Originality Highly original
AI Analysis

This addresses the problem of life-long localization for applications like augmented reality or autonomous robots, with incremental improvements in handling difficult cases.

The paper tackles robust visual localization under extreme viewing conditions by proposing a method based on joint 3D geometric and semantic understanding, achieving reliable localization on challenging datasets.

Robust visual localization under a wide range of viewing conditions is a fundamental problem in computer vision. Handling the difficult cases of this problem is not only very challenging but also of high practical relevance, e.g., in the context of life-long localization for augmented reality or autonomous robots. In this paper, we propose a novel approach based on a joint 3D geometric and semantic understanding of the world, enabling it to succeed under conditions where previous approaches failed. Our method leverages a novel generative model for descriptor learning, trained on semantic scene completion as an auxiliary task. The resulting 3D descriptors are robust to missing observations by encoding high-level 3D geometric and semantic information. Experiments on several challenging large-scale localization datasets demonstrate reliable localization under extreme viewpoint, illumination, and geometry changes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes