ROLGSYAug 26, 2020

Safe Active Dynamics Learning and Control: A Sequential Exploration-Exploitation Framework

arXiv:2008.11700v461 citations
AI Analysis

This addresses the challenge of ensuring safety for autonomous robots in uncertain environments, representing a strong specific gain in safe control.

The paper tackles the problem of safe autonomous robot deployment under dynamics uncertainty by proposing a Bayesian meta-learning framework with last-layer adaptation, which guarantees high-probability constraint satisfaction at all times through a sequential exploration-exploitation strategy.

Safe deployment of autonomous robots in diverse scenarios requires agents that are capable of efficiently adapting to new environments while satisfying constraints. In this work, we propose a practical and theoretically-justified approach to maintaining safety in the presence of dynamics uncertainty. Our approach leverages Bayesian meta-learning with last-layer adaptation. The expressiveness of neural-network features trained offline, paired with efficient last-layer online adaptation, enables the derivation of tight confidence sets which contract around the true dynamics as the model adapts online. We exploit these confidence sets to plan trajectories that guarantee the safety of the system. Our approach handles problems with high dynamics uncertainty, where reaching the goal safely is potentially initially infeasible, by first \textit{exploring} to gather data and reduce uncertainty, before autonomously \textit{exploiting} the acquired information to safely perform the task. Under reasonable assumptions, we prove that our framework guarantees the high-probability satisfaction of all constraints at all times jointly, i.e. over the total task duration. This theoretical analysis also motivates two regularizers of last-layer meta-learning models that improve online adaptation capabilities as well as performance by reducing the size of the confidence sets. We extensively demonstrate our approach in simulation and on hardware.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes