LGNov 15, 2020

CDT: Cascading Decision Trees for Explainable Reinforcement Learning

arXiv:2011.07553v223 citations
AI Analysis

This work addresses the need for explainable AI in reinforcement learning, offering incremental improvements for researchers and practitioners in the field.

The authors tackled the problem of explaining reinforcement learning policies by proposing Cascading Decision Trees (CDTs), which improve performance and explainability over existing tree-based methods, achieving better results with more succinct models in both policy approximation and imitation learning scenarios.

Deep Reinforcement Learning (DRL) has recently achieved significant advances in various domains. However, explaining the policy of RL agents still remains an open problem due to several factors, one being the complexity of explaining neural networks decisions. Recently, a group of works have used decision-tree-based models to learn explainable policies. Soft decision trees (SDTs) and discretized differentiable decision trees (DDTs) have been demonstrated to achieve both good performance and share the benefit of having explainable policies. In this work, we further improve the results for tree-based explainable RL in both performance and explainability. Our proposal, Cascading Decision Trees (CDTs) apply representation learning on the decision path to allow richer expressivity. Empirical results show that in both situations, where CDTs are used as policy function approximators or as imitation learners to explain black-box policies, CDTs can achieve better performances with more succinct and explainable models than SDTs. As a second contribution our study reveals limitations of explaining black-box policies via imitation learning with tree-based explainable models, due to its inherent instability.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes