LG AI MLOct 29, 2019

Weight of Evidence as a Basis for Human-Oriented Explanations

David Alvarez-Melis, Hal Daumé, Jennifer Wortman Vaughan, Hanna Wallach

arXiv:1910.13503v111.522 citations

Originality Incremental advance

AI Analysis

This work addresses the need for more human-oriented interpretability in machine learning, which is crucial for users who rely on explanations to trust and understand AI decisions, though it appears incremental as it builds on existing concepts.

The paper tackled the problem of aligning machine-generated explanations with human preferences by drawing on insights from philosophy and cognitive science, formalizing a framework using weight of evidence from information theory, and demonstrating its effectiveness in producing intuitive explanations in two applications.

Interpretability is an elusive but highly sought-after characteristic of modern machine learning methods. Recent work has focused on interpretability via $\textit{explanations}$, which justify individual model predictions. In this work, we take a step towards reconciling machine explanations with those that humans produce and prefer by taking inspiration from the study of explanation in philosophy, cognitive science, and the social sciences. We identify key aspects in which these human explanations differ from current machine explanations, distill them into a list of desiderata, and formalize them into a framework via the notion of $\textit{weight of evidence}$ from information theory. Finally, we instantiate this framework in two simple applications and show it produces intuitive and comprehensible explanations.

View on arXiv PDF

Similar