MLLGMar 4, 2022

Interpretable Off-Policy Learning via Hyperbox Search

arXiv:2203.02473v27 citationsh-index: 41
AI Analysis

It addresses the need for interpretable decision-making in clinical practice, offering a domain-specific incremental improvement over existing methods.

The paper tackles the problem of learning interpretable personalized treatment policies from observational data, proposing an algorithm via hyperbox search that outperforms state-of-the-art methods in terms of regret and is rated as highly interpretable by clinical experts.

Personalized treatment decisions have become an integral part of modern medicine. Thereby, the aim is to make treatment decisions based on individual patient characteristics. Numerous methods have been developed for learning such policies from observational data that achieve the best outcome across a certain policy class. Yet these methods are rarely interpretable. However, interpretability is often a prerequisite for policy learning in clinical practice. In this paper, we propose an algorithm for interpretable off-policy learning via hyperbox search. In particular, our policies can be represented in disjunctive normal form (i.e., OR-of-ANDs) and are thus intelligible. We prove a universal approximation theorem that shows that our policy class is flexible enough to approximate any measurable function arbitrarily well. For optimization, we develop a tailored column generation procedure within a branch-and-bound framework. Using a simulation study, we demonstrate that our algorithm outperforms state-of-the-art methods from interpretable off-policy learning in terms of regret. Using real-word clinical data, we perform a user study with actual clinical experts, who rate our policies as highly interpretable.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes