LGCRROFeb 25, 2025

ARBoids: Adaptive Residual Reinforcement Learning With Boids Model for Cooperative Multi-USV Target Defense

arXiv:2502.18549v2h-index: 4IEEE Robot Autom Lett
AI Analysis

This addresses a challenging cooperative multi-agent interception problem in robotics, with incremental improvements in adaptability and robustness for USV defense scenarios.

The paper tackles the target defense problem for unmanned surface vehicles (USVs) where attackers have superior maneuverability, by introducing ARBoids, an adaptive residual reinforcement learning framework that integrates deep reinforcement learning with the Boids model, demonstrating superior performance over traditional strategies in simulations.

The target defense problem (TDP) for unmanned surface vehicles (USVs) concerns intercepting an adversarial USV before it breaches a designated target region, using one or more defending USVs. A particularly challenging scenario arises when the attacker exhibits superior maneuverability compared to the defenders, significantly complicating effective interception. To tackle this challenge, this letter introduces ARBoids, a novel adaptive residual reinforcement learning framework that integrates deep reinforcement learning (DRL) with the biologically inspired, force-based Boids model. Within this framework, the Boids model serves as a computationally efficient baseline policy for multi-agent coordination, while DRL learns a residual policy to adaptively refine and optimize the defenders' actions. The proposed approach is validated in a high-fidelity Gazebo simulation environment, demonstrating superior performance over traditional interception strategies, including pure force-based approaches and vanilla DRL policies. Furthermore, the learned policy exhibits strong adaptability to attackers with diverse maneuverability profiles, highlighting its robustness and generalization capability. The code of ARBoids will be released upon acceptance of this letter.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes