LGAIFeb 26, 2023

Q-Cogni: An Integrated Causal Reinforcement Learning Framework

arXiv:2302.13240v13 citationsh-index: 154
Originality Incremental advance
AI Analysis

This addresses the need for more interpretable and efficient reinforcement learning in domains like vehicle routing, though it appears incremental as it redesigns Q-Learning with causal methods.

The paper tackles the problem of improving reinforcement learning by integrating causal inference, resulting in better policies, improved learning efficiency, and superior interpretability, with real-world application showing 85% of cases achieving equal or better policies.

We present Q-Cogni, an algorithmically integrated causal reinforcement learning framework that redesigns Q-Learning with an autonomous causal structure discovery method to improve the learning process with causal inference. Q-Cogni achieves optimal learning with a pre-learned structural causal model of the environment that can be queried during the learning process to infer cause-and-effect relationships embedded in a state-action space. We leverage on the sample efficient techniques of reinforcement learning, enable reasoning about a broader set of policies and bring higher degrees of interpretability to decisions made by the reinforcement learning agent. We apply Q-Cogni on the Vehicle Routing Problem (VRP) and compare against state-of-the-art reinforcement learning algorithms. We report results that demonstrate better policies, improved learning efficiency and superior interpretability of the agent's decision making. We also compare this approach with traditional shortest-path search algorithms and demonstrate the benefits of our causal reinforcement learning framework to high dimensional problems. Finally, we apply Q-Cogni to derive optimal routing decisions for taxis in New York City using the Taxi & Limousine Commission trip record data and compare with shortest-path search, reporting results that show 85% of the cases with an equal or better policy derived from Q-Cogni in a real-world domain.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes