LGDSOCMay 13, 2021

Online Algorithms and Policies Using Adaptive and Machine Learning Approaches

arXiv:2105.06577v744 citations
Originality Incremental advance
AI Analysis

This work addresses real-time control under parametric uncertainties for dynamic systems like robotics, offering a hybrid method that is incremental in combining existing RL and AC techniques.

The paper tackles real-time control in uncertain dynamic systems by combining Reinforcement Learning (RL) for optimal nominal policies and Adaptive Control (AC) for stability, proposing AC-RL controllers for two classes of nonlinear systems and demonstrating stability and parameter learning. Numerical validation on a quadrotor landing task shows the approach accommodates uncertainties and input limits.

This paper considers the problem of real-time control and learning in dynamic systems subjected to parametric uncertainties. We propose a combination of a Reinforcement Learning (RL) based policy in the outer loop suitably chosen to ensure stability and optimality for the nominal dynamics, together with Adaptive Control (AC) in the inner loop so that in real-time AC contracts the closed-loop dynamics towards a stable trajectory traced out by RL. Two classes of nonlinear dynamic systems are considered, both of which are control-affine. The first class of dynamic systems utilizes equilibrium points %with expansion forms around these points and a Lyapunov approach while second class of nonlinear systems uses contraction theory. AC-RL controllers are proposed for both classes of systems and shown to lead to online policies that guarantee stability using a high-order tuner and accommodate parametric uncertainties and magnitude limits on the input. In addition to establishing a stability guarantee with real-time control, the AC-RL controller is also shown to lead to parameter learning with persistent excitation for the first class of systems. Numerical validations of all algorithms are carried out using a quadrotor landing task on a moving platform.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes