LGNov 15, 2020

CDT: Cascading Decision Trees for Explainable Reinforcement Learning

Zihan Ding, Pablo Hernandez-Leal, Gavin Weiguang Ding, Changjian Li, Ruitong Huang

arXiv:2011.07553v210.123 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the need for explainable AI in reinforcement learning, offering incremental improvements for researchers and practitioners in the field.

The authors tackled the problem of explaining reinforcement learning policies by proposing Cascading Decision Trees (CDTs), which improve performance and explainability over existing tree-based methods, achieving better results with more succinct models in both policy approximation and imitation learning scenarios.

Deep Reinforcement Learning (DRL) has recently achieved significant advances in various domains. However, explaining the policy of RL agents still remains an open problem due to several factors, one being the complexity of explaining neural networks decisions. Recently, a group of works have used decision-tree-based models to learn explainable policies. Soft decision trees (SDTs) and discretized differentiable decision trees (DDTs) have been demonstrated to achieve both good performance and share the benefit of having explainable policies. In this work, we further improve the results for tree-based explainable RL in both performance and explainability. Our proposal, Cascading Decision Trees (CDTs) apply representation learning on the decision path to allow richer expressivity. Empirical results show that in both situations, where CDTs are used as policy function approximators or as imitation learners to explain black-box policies, CDTs can achieve better performances with more succinct and explainable models than SDTs. As a second contribution our study reveals limitations of explaining black-box policies via imitation learning with tree-based explainable models, due to its inherent instability.

View on arXiv PDF Code

Similar