AI LGApr 25, 2023

Federated Deep Reinforcement Learning for THz-Beam Search with Limited CSI

Po-Chun Hsu, Li-Hsiang Shen, Chun-Hung Liu, Kai-Ten Feng

arXiv:2304.13109v16.77 citationsh-index: 23

Originality Incremental advance

AI Analysis

This work addresses a domain-specific problem for next-generation wireless networks, offering incremental improvements in beam search efficiency.

The paper tackles the problem of efficiently finding beam directions for terahertz communication in cellular networks to overcome severe propagation attenuation, proposing a federated deep reinforcement learning approach that achieves higher throughput and reduces communication load compared to conventional methods.

Terahertz (THz) communication with ultra-wide available spectrum is a promising technique that can achieve the stringent requirement of high data rate in the next-generation wireless networks, yet its severe propagation attenuation significantly hinders its implementation in practice. Finding beam directions for a large-scale antenna array to effectively overcome severe propagation attenuation of THz signals is a pressing need. This paper proposes a novel approach of federated deep reinforcement learning (FDRL) to swiftly perform THz-beam search for multiple base stations (BSs) coordinated by an edge server in a cellular network. All the BSs conduct deep deterministic policy gradient (DDPG)-based DRL to obtain THz beamforming policy with limited channel state information (CSI). They update their DDPG models with hidden information in order to mitigate inter-cell interference. We demonstrate that the cell network can achieve higher throughput as more THz CSI and hidden neurons of DDPG are adopted. We also show that FDRL with partial model update is able to nearly achieve the same performance of FDRL with full model update, which indicates an effective means to reduce communication load between the edge server and the BSs by partial model uploading. Moreover, the proposed FDRL outperforms conventional non-learning-based and existing non-FDRL benchmark optimization methods.

View on arXiv PDF

Similar