CV LG ROMay 19, 2020

Differentiable Mapping Networks: Learning Structured Map Representations for Sparse Visual Localization

Peter Karkus, Anelia Angelova, Vincent Vanhoucke, Rico Jonschkowski

arXiv:2005.09530v15.85 citations

Originality Incremental advance

AI Analysis

This work addresses mapping and localization for robotics, offering a novel method that improves performance in sparse data scenarios, though it is incremental as it builds on existing neural network and particle filter approaches.

The paper tackles sparse visual localization by introducing the Differentiable Mapping Network (DMN), which learns structured map representations from few observations, achieving effective localization in simulated and real-world Street View datasets with benefits scaling in larger environments and scarce training data.

Mapping and localization, preferably from a small number of observations, are fundamental tasks in robotics. We address these tasks by combining spatial structure (differentiable mapping) and end-to-end learning in a novel neural network architecture: the Differentiable Mapping Network (DMN). The DMN constructs a spatially structured view-embedding map and uses it for subsequent visual localization with a particle filter. Since the DMN architecture is end-to-end differentiable, we can jointly learn the map representation and localization using gradient descent. We apply the DMN to sparse visual localization, where a robot needs to localize in a new environment with respect to a small number of images from known viewpoints. We evaluate the DMN using simulated environments and a challenging real-world Street View dataset. We find that the DMN learns effective map representations for visual localization. The benefit of spatial structure increases with larger environments, more viewpoints for mapping, and when training data is scarce. Project website: http://sites.google.com/view/differentiable-mapping

View on arXiv PDF

Similar