NI LGDec 5, 2024

Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach

Xiaowen Ye, Yuyi Mao, Xianghao Yu, Shu Sun, Liqun Fu, Jie Xu

arXiv:2412.04074v36.639 citationsh-index: 37IEEE Trans Wirel Commun

Originality Incremental advance

AI Analysis

It addresses the challenge of efficient resource allocation in low-altitude airspace for UAVs, which is an incremental improvement in domain-specific applications.

This paper tackles the problem of optimizing integrated sensing and communications for low-altitude economy by jointly optimizing beamforming and UAV trajectories to maximize communication sum-rate, using a deep reinforcement learning approach that achieves higher sum-rate, faster convergence, and improved robustness compared to benchmarks.

This paper studies an integrated sensing and communications (ISAC) system for low-altitude economy (LAE), where a ground base station (GBS) provides communication and navigation services for authorized unmanned aerial vehicles (UAVs), while sensing the low-altitude airspace to monitor the unauthorized mobile target. The expected communication sum-rate over a given flight period is maximized by jointly optimizing the beamforming at the GBS and UAVs' trajectories, subject to the constraints on the average signal-to-noise ratio requirement for sensing, the flight mission and collision avoidance of UAVs, as well as the maximum transmit power at the GBS. Typically, this is a sequential decision-making problem with the given flight mission. Thus, we transform it to a specific Markov decision process (MDP) model called episode task. Based on this modeling, we propose a novel LAE-oriented ISAC scheme, referred to as Deep LAE-ISAC (DeepLSC), by leveraging the deep reinforcement learning (DRL) technique. In DeepLSC, a reward function and a new action selection policy termed constrained noise-exploration policy are judiciously designed to fulfill various constraints. To enable efficient learning in episode tasks, we develop a hierarchical experience replay mechanism, where the gist is to employ all experiences generated within each episode to jointly train the neural network. Besides, to enhance the convergence speed of DeepLSC, a symmetric experience augmentation mechanism, which simultaneously permutes the indexes of all variables to enrich available experience sets, is proposed. Simulation results demonstrate that compared with benchmarks, DeepLSC yields a higher sum-rate while meeting the preset constraints, achieves faster convergence, and is more robust against different settings.

View on arXiv PDF

Similar