ROAICVLGSYMar 20, 2020

Visual Navigation Among Humans with Optimal Control as a Supervisor

arXiv:2003.09354v247 citations
AI Analysis

This addresses the challenge of safe and efficient robot navigation in crowded human spaces, representing an incremental improvement by integrating existing methods in a novel way.

The paper tackles the problem of visual navigation for robots in dynamic, human-occupied environments by combining learning-based perception with model-based optimal control, using only monocular RGB images. It demonstrates that the learned policies can anticipate and react to humans without explicit motion prediction, generalize to unseen environments and behaviors, and transfer directly from simulation to reality.

Real world visual navigation requires robots to operate in unfamiliar, human-occupied dynamic environments. Navigation around humans is especially difficult because it requires anticipating their future motion, which can be quite challenging. We propose an approach that combines learning-based perception with model-based optimal control to navigate among humans based only on monocular, first-person RGB images. Our approach is enabled by our novel data-generation tool, HumANav that allows for photorealistic renderings of indoor environment scenes with humans in them, which are then used to train the perception module entirely in simulation. Through simulations and experiments on a mobile robot, we demonstrate that the learned navigation policies can anticipate and react to humans without explicitly predicting future human motion, generalize to previously unseen environments and human behaviors, and transfer directly from simulation to reality. Videos describing our approach and experiments, as well as a demo of HumANav are available on the project website.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes