AIJun 2, 2021

Smooth Q-learning: Accelerate Convergence of Q-learning Using Similarity

arXiv:2106.01134v1
Originality Incremental advance
AI Analysis

This is an incremental improvement for reinforcement learning practitioners seeking faster training convergence.

The paper tackles the problem of slow convergence in Q-learning by incorporating similarity between states and actions, resulting in significantly better performance compared to classic Q-learning as shown in numerical examples.

An improvement of Q-learning is proposed in this paper. It is different from classic Q-learning in that the similarity between different states and actions is considered in the proposed method. During the training, a new updating mechanism is used, in which the Q value of the similar state-action pairs are updated synchronously. The proposed method can be used in combination with both tabular Q-learning function and deep Q-learning. And the results of numerical examples illustrate that compared to the classic Q-learning, the proposed method has a significantly better performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes