LGROMLFeb 26, 2020

Learning Navigation Costs from Demonstration in Partially Observable Environments

arXiv:2002.11637v14 citations
AI Analysis

This work addresses safe and efficient autonomous navigation for robots in unknown, partially observable settings, representing an incremental advance in inverse reinforcement learning methods.

The paper tackles the problem of learning navigation costs from expert demonstrations in partially observable environments, achieving higher accuracy than baseline inverse reinforcement learning algorithms and significantly improving training and inference efficiency.

This paper focuses on inverse reinforcement learning (IRL) to enable safe and efficient autonomous navigation in unknown partially observable environments. The objective is to infer a cost function that explains expert-demonstrated navigation behavior while relying only on the observations and state-control trajectory used by the expert. We develop a cost function representation composed of two parts: a probabilistic occupancy encoder, with recurrent dependence on the observation sequence, and a cost encoder, defined over the occupancy features. The representation parameters are optimized by differentiating the error between demonstrated controls and a control policy computed from the cost encoder. Such differentiation is typically computed by dynamic programming through the value function over the whole state space. We observe that this is inefficient in large partially observable environments because most states are unexplored. Instead, we rely on a closed-form subgradient of the cost-to-go obtained only over a subset of promising states via an efficient motion-planning algorithm such as A* or RRT. Our experiments show that our model exceeds the accuracy of baseline IRL algorithms in robot navigation tasks, while substantially improving the efficiency of training and test-time inference.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes