CVLGMLFeb 18, 2019

A Generative Map for Image-based Camera Localization

arXiv:1902.11124v43 citations
Originality Incremental advance
AI Analysis

This addresses the need for interpretable maps in camera localization systems, which is incremental as it builds on existing neural network approaches.

The paper tackles the problem of creating human-readable neural network maps for image-based camera localization by proposing a Generative Map framework that combines a generative model with a Kalman filter, achieving comparable localization performance to current regression models on the 7-Scenes and Oxford RobotCar datasets.

In image-based camera localization systems, information about the environment is usually stored in some representation, which can be referred to as a map. Conventionally, most maps are built upon hand-crafted features. Recently, neural networks have attracted attention as a data-driven map representation, and have shown promising results in visual localization. However, these neural network maps are generally hard to interpret by human. A readable map is not only accessible to humans, but also provides a way to be verified when the ground truth pose is unavailable. To tackle this problem, we propose Generative Map, a new framework for learning human-readable neural network maps, by combining a generative model with the Kalman filter, which also allows it to incorporate additional sensor information such as stereo visual odometry. For evaluation, we use real world images from the 7-Scenes and Oxford RobotCar datasets. We demonstrate that our Generative Map can be queried with a pose of interest from the test sequence to predict an image, which closely resembles the true scene. For localization, we show that Generative Map achieves comparable performance with current regression models. Moreover, our framework is trained completely from scratch, unlike regression models which rely on large ImageNet pretrained networks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes