ROMar 31, 2020

Enabling Topological Planning with Monocular Vision

arXiv:2003.14368v16 citations
AI Analysis

This work addresses the problem of computationally efficient navigation for robots in complex environments, though it appears incremental as it builds on existing topological and monocular SLAM methods.

The paper tackled the challenge of enabling topological planning with monocular vision in low-texture or cluttered environments by proposing a robust sparse map representation built from streaming images, which was shown to be sufficient for planning and exploration in simulated multi-agent search and learned subgoal applications.

Topological strategies for navigation meaningfully reduce the space of possible actions available to a robot, allowing use of heuristic priors or learning to enable computationally efficient, intelligent planning. The challenges in estimating structure with monocular SLAM in low texture or highly cluttered environments have precluded its use for topological planning in the past. We propose a robust sparse map representation that can be built with monocular vision and overcomes these shortcomings. Using a learned sensor, we estimate high-level structure of an environment from streaming images by detecting sparse vertices (e.g., boundaries of walls) and reasoning about the structure between them. We also estimate the known free space in our map, a necessary feature for planning through previously unknown environments. We show that our mapping technique can be used on real data and is sufficient for planning and exploration in simulated multi-agent search and learned subgoal planning applications.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes