RO LG MA SYSep 19, 2023

Learning Adaptive Safety for Multi-Agent Systems

Luigi Berducci, Shuo Yang, Rahul Mangharam, Radu Grosu

arXiv:2309.10657v27.412 citationsh-index: 43Has Code

Originality Incremental advance

AI Analysis

This work addresses safety assurance for multi-agent systems, which is crucial for applications like robotics and autonomous vehicles, but it is incremental as it builds on existing Control Barrier Function methods.

The paper tackles the challenge of ensuring safety in dynamic multi-agent systems by introducing ASRL, an adaptive safe reinforcement learning framework that automates the optimization of policies and Control Barrier Function coefficients, reducing cost violations below desired limits in multi-robot and racing scenarios.

Ensuring safety in dynamic multi-agent systems is challenging due to limited information about the other agents. Control Barrier Functions (CBFs) are showing promise for safety assurance but current methods make strong assumptions about other agents and often rely on manual tuning to balance safety, feasibility, and performance. In this work, we delve into the problem of adaptive safe learning for multi-agent systems with CBF. We show how emergent behavior can be profoundly influenced by the CBF configuration, highlighting the necessity for a responsive and dynamic approach to CBF design. We present ASRL, a novel adaptive safe RL framework, to fully automate the optimization of policy and CBF coefficients, to enhance safety and long-term performance through reinforcement learning. By directly interacting with the other agents, ASRL learns to cope with diverse agent behaviours and maintains the cost violations below a desired limit. We evaluate ASRL in a multi-robot system and a competitive multi-agent racing scenario, against learning-based and control-theoretic approaches. We empirically demonstrate the efficacy and flexibility of ASRL, and assess generalization and scalability to out-of-distribution scenarios. Code and supplementary material are public online.

View on arXiv PDF Code

Similar