LG SYJun 4, 2021

Robustifying Reinforcement Learning Policies with $\mathcal{L}_1$ Adaptive Control

Yikun Cheng, Pan Zhao, Manan Gandhi, Bo Li, Evangelos Theodorou, Naira Hovakimyan

arXiv:2106.02249v54.42 citations

Originality Incremental advance

AI Analysis

This addresses robustness issues for RL applications in dynamic environments, but it is incremental as it builds on existing control methods.

The paper tackles the problem of reinforcement learning policies failing in perturbed environments by proposing an approach using L1 adaptive control to robustify pre-trained policies, showing significant robustness improvements in numerical experiments.

A reinforcement learning (RL) policy trained in a nominal environment could fail in a new/perturbed environment due to the existence of dynamic variations. Existing robust methods try to obtain a fixed policy for all envisioned dynamic variation scenarios through robust or adversarial training. These methods could lead to conservative performance due to emphasis on the worst case, and often involve tedious modifications to the training environment. We propose an approach to robustifying a pre-trained non-robust RL policy with $\mathcal{L}_1$ adaptive control. Leveraging the capability of an $\mathcal{L}_1$ control law in the fast estimation of and active compensation for dynamic variations, our approach can significantly improve the robustness of an RL policy trained in a standard (i.e., non-robust) way, either in a simulator or in the real world. Numerical experiments are provided to validate the efficacy of the proposed approach.

View on arXiv PDF

Similar