Asynchronous training of quantum reinforcement learning
This addresses a computational bottleneck for researchers exploring quantum reinforcement learning applications, though it appears incremental as it adapts existing asynchronous training techniques to the quantum domain.
The paper tackles the computational resource challenge in training quantum reinforcement learning agents using variational quantum circuits by proposing asynchronous training methods, demonstrating through numerical simulations that their approach achieves performance comparable to or superior to classical agents with similar model sizes and architectures.
The development of quantum machine learning (QML) has received a lot of interest recently thanks to developments in both quantum computing (QC) and machine learning (ML). One of the ML paradigms that can be utilized to address challenging sequential decision-making issues is reinforcement learning (RL). It has been demonstrated that classical RL can successfully complete many difficult tasks. A leading method of building quantum RL agents relies on the variational quantum circuits (VQC). However, training QRL algorithms with VQCs requires significant amount of computational resources. This issue hurdles the exploration of various QRL applications. In this paper, we approach this challenge through asynchronous training QRL agents. Specifically, we choose the asynchronous training of advantage actor-critic variational quantum policies. We demonstrate the results via numerical simulations that within the tasks considered, the asynchronous training of QRL agents can reach performance comparable to or superior than classical agents with similar model sizes and architectures.