AIFeb 5, 2019

Explanation in Human-AI Systems: A Literature Meta-Review, Synopsis of Key Ideas and Publications, and Bibliography for Explainable AI

arXiv:1902.01876v1316 citations
Originality Synthesis-oriented
AI Analysis

It provides a meta-review for AI/XAI researchers to understand explanation concepts and improve research practices, but it is incremental as it compiles existing literature without new findings.

This integrative review synthesizes literature on what constitutes a good explanation in AI systems, covering historical efforts in intelligent tutoring and expert systems, modern explainability challenges, and key psychological theories, while recommending more detailed empirical methods in AI/XAI research reports.

This is an integrative review that address the question, "What makes for a good explanation?" with reference to AI systems. Pertinent literatures are vast. Thus, this review is necessarily selective. That said, most of the key concepts and issues are expressed in this Report. The Report encapsulates the history of computer science efforts to create systems that explain and instruct (intelligent tutoring systems and expert systems). The Report expresses the explainability issues and challenges in modern AI, and presents capsule views of the leading psychological theories of explanation. Certain articles stand out by virtue of their particular relevance to XAI, and their methods, results, and key points are highlighted. It is recommended that AI/XAI researchers be encouraged to include in their research reports fuller details on their empirical or experimental methods, in the fashion of experimental psychology research reports: details on Participants, Instructions, Procedures, Tasks, Dependent Variables (operational definitions of the measures and metrics), Independent Variables (conditions), and Control Conditions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes