SY SYJan 20, 2025

Fast State Stabilization using Deep Reinforcement Learning for Measurement-based Quantum Feedback Control

Chunxiang Song, Yanan Liu, Daoyi Dong, Hidehiro Yonezawa

arXiv:2408.113281 citationsh-index: 29

AI Analysis

For quantum technology researchers, this provides a faster feedback control method to mitigate decoherence, but it is an incremental application of existing DRL to a known problem.

This paper uses deep reinforcement learning to stabilize quantum states faster than traditional methods like Lyapunov control, achieving convergence times up to 50% shorter in two-qubit and three-qubit simulations, while maintaining robustness against measurement imperfections and delays.

The stabilization of quantum states is a fundamental problem for realizing various quantum technologies. Measurement-based-feedback strategies have demonstrated powerful performance, and the construction of quantum control signals using measurement information has attracted great interest. However, the interaction between quantum systems and the environment is inevitable, especially when measurements are introduced, which leads to decoherence. To mitigate decoherence, it is desirable to stabilize quantum systems faster, thereby reducing the time of interaction with the environment. In this paper, we utilize information obtained from measurement and apply deep reinforcement learning (DRL) algorithms, without explicitly constructing specific complex measurement-control mappings, to rapidly drive random initial quantum state to the target state. The proposed DRL algorithm has the ability to speed up the convergence to a target state, which shortens the interaction between quantum systems and their environments to protect coherence. Simulations are performed on two-qubit and three-qubit systems, and the results show that our algorithm can successfully stabilize random initial quantum system to the target entangled state, with a convergence time faster than traditional methods such as Lyapunov feedback control and several DRL algorithms with different reward functions. Moreover, it exhibits robustness against imperfect measurements and delays in system evolution.

View on arXiv PDF

Similar