ITAISep 7, 2024

Causality-Driven Reinforcement Learning for Joint Communication and Sensing

arXiv:2409.15329v12 citationsh-index: 7
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in 6G wireless networks for applications like autonomous driving, offering an incremental improvement in beamforming efficiency.

The paper tackles the inefficiency of reinforcement learning in massive MIMO-based joint communication and sensing systems by proposing a causality-driven RL agent that discovers causal relationships during training, resulting in improved beamforming gain over baseline methods.

The next-generation wireless network, 6G and beyond, envisions to integrate communication and sensing to overcome interference, improve spectrum efficiency, and reduce hardware and power consumption. Massive Multiple-Input Multiple Output (mMIMO)-based Joint Communication and Sensing (JCAS) systems realize this integration for 6G applications such as autonomous driving, as it requires accurate environmental sensing and time-critical communication with neighboring vehicles. Reinforcement Learning (RL) is used for mMIMO antenna beamforming in the existing literature. However, the huge search space for actions associated with antenna beamforming causes the learning process for the RL agent to be inefficient due to high beam training overhead. The learning process does not consider the causal relationship between action space and the reward, and gives all actions equal importance. In this work, we explore a causally-aware RL agent which can intervene and discover causal relationships for mMIMO-based JCAS environments, during the training phase. We use a state dependent action dimension selection strategy to realize causal discovery for RL-based JCAS. Evaluation of the causally-aware RL framework in different JCAS scenarios shows the benefit of our proposed framework over baseline methods in terms of the beamforming gain.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes