RO LG SYOct 23, 2022

Active Learning of Discrete-Time Dynamics for Uncertainty-Aware Model Predictive Control

Alessandro Saviolo, Jonathan Frey, Abhishek Rathod, Moritz Diehl, Giuseppe Loianno

arXiv:2210.12583v419.453 citationsh-index: 67

Originality Incremental advance

AI Analysis

This addresses the need for adaptive and uncertainty-aware model predictive control in robotics, particularly for systems like quadrotors operating in dynamic environments, though it is incremental as it builds on existing active learning and control methods.

The paper tackles the problem of accurately modeling nonlinear robotic system dynamics for control in varying conditions by presenting a self-supervised learning approach that combines offline and online learning, resulting in high sample efficiency and real-time adaptation, with experiments on a quadrotor showing significant outperformance over baselines.

Model-based control requires an accurate model of the system dynamics for precisely and safely controlling the robot in complex and dynamic environments. Moreover, in the presence of variations in the operating conditions, the model should be continuously refined to compensate for dynamics changes. In this paper, we present a self-supervised learning approach that actively models the dynamics of nonlinear robotic systems. We combine offline learning from past experience and online learning from current robot interaction with the unknown environment. These two ingredients enable a highly sample-efficient and adaptive learning process, capable of accurately inferring model dynamics in real-time even in operating regimes that greatly differ from the training distribution. Moreover, we design an uncertainty-aware model predictive controller that is heuristically conditioned to the aleatoric (data) uncertainty of the learned dynamics. This controller actively chooses the optimal control actions that (i) optimize the control performance and (ii) improve the efficiency of online learning sample collection. We demonstrate the effectiveness of our method through a series of challenging real-world experiments using a quadrotor system. Our approach showcases high resilience and generalization capabilities by consistently adapting to unseen flight conditions, while it significantly outperforms classical and adaptive control baselines.

View on arXiv PDF

Similar