LG MLMar 2, 2020

Robust Policy Search for Robot Navigation

Javier Garcia-Barcos, Ruben Martinez-Cantin

arXiv:2003.01000v23.32 citations

Originality Incremental advance

AI Analysis

This work addresses the challenge of expensive interactive learning for robots in uncertain settings, offering an incremental improvement over existing Bayesian optimization methods.

The authors tackled the problem of data-efficient policy search for robot navigation in uncertain environments by combining robust optimization and statistical robustness, achieving improved performance and convergence guarantees in benchmarks and robot tasks.

Complex robot navigation and control problems can be framed as policy search problems. However, interactive learning in uncertain environments can be expensive, requiring the use of data-efficient methods. Bayesian optimization is an efficient nonlinear optimization method where queries are carefully selected to gather information about the optimum location. This is achieved by a surrogate model, which encodes past information, and the acquisition function for query selection. Bayesian optimization can be very sensitive to uncertainty in the input data or prior assumptions. In this work, we incorporate both robust optimization and statistical robustness, showing that both types of robustness are synergistic. For robust optimization we use an improved version of unscented Bayesian optimization which provides safe and repeatable policies in the presence of policy uncertainty. We also provide new theoretical insights. For statistical robustness, we use an adaptive surrogate model and we introduce the Boltzmann selection as a stochastic acquisition method to have convergence guarantees and improved performance even with surrogate modeling errors. We present results in several optimization benchmarks and robot tasks.

View on arXiv PDF

Similar