Multi-Agent SAC Enabled Beamforming Design for Joint Secret Key Generation and Data Transmission
This work addresses security and efficiency issues in wireless communication for applications requiring quantum-resistant key distribution, representing an incremental improvement with a novel method for a known bottleneck.
The paper tackles the challenge of simultaneously achieving high data transmission and secret key generation rates under eavesdropping attacks in time-division duplex systems, proposing a multi-agent soft actor-critic framework that outperforms benchmarks and balances this trade-off effectively.
Physical layer key generation (PLKG) has emerged as a promising solution for achieving highly secured and low-latency key distribution, offering information-theoretic security that is inherently resilient to quantum attacks. However, simultaneously ensuring a high data transmission rate and a high secret key generation rate under eavesdropping attacks remains a major challenge. In time-division duplex (TDD) systems with multiple antennas, we derive closed-form expressions for both rates by modeling the legitimate channel as a time-correlated autoregressive (AR) process. This formulation leads to a highly nonconvex and time-coupled optimization problem, rendering traditional optimization methods ineffective. To address this issue, we propose a multi-agent soft actor-critic (SAC) framework equipped with a long short-term memory (LSTM) adversary prediction module to cope with the partial observability of the eavesdropper's mode. Simulation results demonstrate that the proposed approach achieves superior performance compared with other benchmark algorithms, while effectively balancing the trade-off between secret key generation rate and data transmission rate. The results also confirm the robustness of the proposed framework against intelligent eavesdropping and partial observation uncertainty.