LGOct 12, 2020

Deep Echo State Q-Network (DEQN) and Its Application in Dynamic Spectrum Sharing for 5G and Beyond

arXiv:2010.05449v18.555 citations

Originality Incremental advance

AI Analysis

This work addresses the problem of efficient spectrum management for 5G and beyond networks, offering a more adaptable DRL approach, though it is incremental as it builds on existing DRL and RNN techniques.

The paper tackles the challenge of applying deep reinforcement learning (DRL) in highly dynamic environments with limited training data, such as 5G cellular communications, by introducing the Deep Echo State Q-Network (DEQN) method. It demonstrates DEQN's effectiveness in dynamic spectrum sharing scenarios, showing it can adapt quickly with reduced training overhead compared to conventional methods.

Deep reinforcement learning (DRL) has been shown to be successful in many application domains. Combining recurrent neural networks (RNNs) and DRL further enables DRL to be applicable in non-Markovian environments by capturing temporal information. However, training of both DRL and RNNs is known to be challenging requiring a large amount of training data to achieve convergence. In many targeted applications, such as those used in the fifth generation (5G) cellular communication, the environment is highly dynamic while the available training data is very limited. Therefore, it is extremely important to develop DRL strategies that are capable of capturing the temporal correlation of the dynamic environment requiring limited training overhead. In this paper, we introduce the deep echo state Q-network (DEQN) that can adapt to the highly dynamic environment in a short period of time with limited training data. We evaluate the performance of the introduced DEQN method under the dynamic spectrum sharing (DSS) scenario, which is a promising technology in 5G and future 6G networks to increase the spectrum utilization. Compared to conventional spectrum management policy that grants a fixed spectrum band to a single system for exclusive access, DSS allows the secondary system to share the spectrum with the primary system. Our work sheds light on the application of an efficient DRL framework in highly dynamic environments with limited available training data.

View on arXiv PDF

Similar