RO AI OCSep 15, 2024

Critic as Lyapunov function (CALF): a model-free, stability-ensuring agent

Pavel Osinenko, Grigory Yaremenko, Roman Zashchitin, Anton Bolychev, Sinan Ibrahim, Dmitrii Dobriborsci

arXiv:2409.09869v18.38 citationsh-index: 6

Originality Highly original

AI Analysis

This work addresses the challenge of combining classical control with reinforcement learning for real-time stabilization, offering a novel approach that is model-free and online, which could benefit robotics and control systems applications.

The paper tackles the problem of stabilizing dynamical systems in reinforcement learning by introducing a model-free agent called CALF, which ensures online stabilization and outperforms baseline methods like SARSA and a modified version, improving learning performance in a mobile robot simulator case study.

This work presents and showcases a novel reinforcement learning agent called Critic As Lyapunov Function (CALF) which is model-free and ensures online environment, in other words, dynamical system stabilization. Online means that in each learning episode, the said environment is stabilized. This, as demonstrated in a case study with a mobile robot simulator, greatly improves the overall learning performance. The base actor-critic scheme of CALF is analogous to SARSA. The latter did not show any success in reaching the target in our studies. However, a modified version thereof, called SARSA-m here, did succeed in some learning scenarios. Still, CALF greatly outperformed the said approach. CALF was also demonstrated to improve a nominal stabilizer provided to it. In summary, the presented agent may be considered a viable approach to fusing classical control with reinforcement learning. Its concurrent approaches are mostly either offline or model-based, like, for instance, those that fuse model-predictive control into the agent.

View on arXiv PDF

Similar