LGNov 12, 2022

Deep Reinforcement Learning with Vector Quantized Encoding

Liang Zhang, Justin Lieffers, Adarsh Pyarelal

arXiv:2211.06733v14.63 citationsh-index: 9

Originality Incremental advance

AI Analysis

This addresses interpretability challenges in deep RL for researchers and practitioners, but it is incremental as it builds on existing RL pipelines with added clustering.

The authors tackled the problem of improving interpretability in deep reinforcement learning by clustering state features into categories, resulting in tighter and better-separated clusters compared to classic methods.

Human decision-making often involves combining similar states into categories and reasoning at the level of the categories rather than the actual states. Guided by this intuition, we propose a novel method for clustering state features in deep reinforcement learning (RL) methods to improve their interpretability. Specifically, we propose a plug-and-play framework termed \emph{vector quantized reinforcement learning} (VQ-RL) that extends classic RL pipelines with an auxiliary classification task based on vector quantized (VQ) encoding and aligns with policy training. The VQ encoding method categorizes features with similar semantics into clusters and results in tighter clusters with better separation compared to classic deep RL methods, thus enabling neural models to learn similarities and differences between states better. Furthermore, we introduce two regularization methods to help increase the separation between clusters and avoid the risks associated with VQ training. In simulations, we demonstrate that VQ-RL improves interpretability and investigate its impact on robustness and generalization of deep RL.

View on arXiv PDF

Similar