NI AI CRJul 23, 2025

LLM Meets the Sky: Heuristic Multi-Agent Reinforcement Learning for Secure Heterogeneous UAV Networks

Lijie Zheng, Ji He, Shih Yu Chang, Yulong Shen, Dusit Niyato

arXiv:2507.17188v13.33 citationsh-index: 19

Originality Incremental advance

AI Analysis

It addresses secure communication for UAV networks, an incremental advance by integrating LLM heuristics into reinforcement learning for trajectory optimization.

This work tackles the physical layer security problem of maximizing secrecy rate in heterogeneous UAV networks under energy constraints, proposing a hierarchical optimization framework that combines semidefinite relaxation and LLM-guided reinforcement learning, achieving improved secrecy rate and energy efficiency over baselines in simulations.

This work tackles the physical layer security (PLS) problem of maximizing the secrecy rate in heterogeneous UAV networks (HetUAVNs) under propulsion energy constraints. Unlike prior studies that assume uniform UAV capabilities or overlook energy-security trade-offs, we consider a realistic scenario where UAVs with diverse payloads and computation resources collaborate to serve ground terminals in the presence of eavesdroppers. To manage the complex coupling between UAV motion and communication, we propose a hierarchical optimization framework. The inner layer uses a semidefinite relaxation (SDR)-based S2DC algorithm combining penalty functions and difference-of-convex (d.c.) programming to solve the secrecy precoding problem with fixed UAV positions. The outer layer introduces a Large Language Model (LLM)-guided heuristic multi-agent reinforcement learning approach (LLM-HeMARL) for trajectory optimization. LLM-HeMARL efficiently incorporates expert heuristics policy generated by the LLM, enabling UAVs to learn energy-aware, security-driven trajectories without the inference overhead of real-time LLM calls. The simulation results show that our method outperforms existing baselines in secrecy rate and energy efficiency, with consistent robustness across varying UAV swarm sizes and random seeds.

View on arXiv PDF

Similar