NI LGJan 29

Securing SIM-Assisted Wireless Networks via Quantum Reinforcement Learning

Le-Hung Hoang, Quang-Trung Luu, Dinh Thai Hoang, Diep N. Nguyen, Van-Dinh Nguyen

arXiv:2602.13238v11.2h-index: 53

Originality Highly original

AI Analysis

This work addresses the problem of securing wireless communications for network operators by introducing a novel quantum-enhanced method, though it is incremental as it builds on existing reinforcement learning and quantum computing techniques.

The paper tackled the challenge of optimizing physical-layer security in SIM-assisted wireless networks, which involves high-dimensional and dynamic optimization, by proposing a hybrid quantum reinforcement learning framework that achieved approximately 15% higher secrecy rates and 30% faster convergence compared to deep reinforcement learning baselines.

Stacked intelligent metasurfaces (SIMs) have recently emerged as a powerful wave-domain technology that enables multi-stage manipulation of electromagnetic signals through multilayer programmable architectures. While SIMs offer unprecedented degrees of freedom for enhancing physical-layer security, their extremely large number of meta-atoms leads to a high-dimensional and strongly coupled optimization space, making conventional design approaches inefficient and difficult to scale. Moreover, existing deep reinforcement learning (DRL) techniques suffer from slow convergence and performance degradation in dynamic wireless environments with imperfect knowledge of passive eavesdroppers. To overcome these challenges, we propose a hybrid quantum proximal policy optimization (Q-PPO) framework for SIM-assisted secure communications, which jointly optimizes transmit power allocation and SIM phase shifts to maximize the average secrecy rate under power and quality-of-service constraints. Specifically, a parameterized quantum circuit is embedded into the actor network, forming a hybrid classical-quantum policy architecture that enhances policy representation capability and exploration efficiency in high-dimensional continuous action spaces. Extensive simulations demonstrate that the proposed Q-PPO scheme consistently outperforms DRL baselines, achieving approximately 15% higher secrecy rates and 30% faster convergence under imperfect eavesdropper channel state information. These results establish Q-PPO as a powerful optimization paradigm for SIM-enabled secure wireless networks.

View on arXiv PDF

Similar