LGAIMLOct 29, 2019

Weight of Evidence as a Basis for Human-Oriented Explanations

arXiv:1910.13503v122 citations
Originality Incremental advance
AI Analysis

This work addresses the need for more human-oriented interpretability in machine learning, which is crucial for users who rely on explanations to trust and understand AI decisions, though it appears incremental as it builds on existing concepts.

The paper tackled the problem of aligning machine-generated explanations with human preferences by drawing on insights from philosophy and cognitive science, formalizing a framework using weight of evidence from information theory, and demonstrating its effectiveness in producing intuitive explanations in two applications.

Interpretability is an elusive but highly sought-after characteristic of modern machine learning methods. Recent work has focused on interpretability via $\textit{explanations}$, which justify individual model predictions. In this work, we take a step towards reconciling machine explanations with those that humans produce and prefer by taking inspiration from the study of explanation in philosophy, cognitive science, and the social sciences. We identify key aspects in which these human explanations differ from current machine explanations, distill them into a list of desiderata, and formalize them into a framework via the notion of $\textit{weight of evidence}$ from information theory. Finally, we instantiate this framework in two simple applications and show it produces intuitive and comprehensible explanations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes