RODec 3, 2021

Improving the Robustness of Reinforcement Learning Policies with $\mathcal{L}_{1}$ Adaptive Control

arXiv:2112.01953v712 citations
Originality Incremental advance
AI Analysis

This addresses robustness issues for reinforcement learning control policies in continuous state and action spaces, but it is incremental as it builds on existing L1 adaptive control methods.

The paper tackles the problem of reinforcement learning policies failing in new or perturbed environments due to dynamic variations by proposing an add-on approach that augments pre-trained policies with an L1 adaptive controller, and numerical and real-world experiments demonstrate its efficacy in robustifying policies trained with model-free and model-based methods.

A reinforcement learning (RL) control policy could fail in a new/perturbed environment that is different from the training environment, due to the presence of dynamic variations. For controlling systems with continuous state and action spaces, we propose an add-on approach to robustifying a pre-trained RL policy by augmenting it with an $\mathcal{L}_{1}$ adaptive controller ($\mathcal{L}_{1}$AC). Leveraging the capability of an $\mathcal{L}_{1}$AC for fast estimation and active compensation of dynamic variations, the proposed approach can improve the robustness of an RL policy which is trained either in a simulator or in the real world without consideration of a broad class of dynamic variations. Numerical and real-world experiments empirically demonstrate the efficacy of the proposed approach in robustifying RL policies trained using both model-free and model-based methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes