AILGMLMay 31, 2018

Explaining Explanations: An Overview of Interpretability of Machine Learning

arXiv:1806.00069v32202 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental survey paper that synthesizes existing work to highlight gaps and propose best practices for improving interpretability in AI, targeting researchers and practitioners in the field.

The paper addresses the lack of standardization and systematic assessment in explanatory AI (XAI), which is crucial for transparency, fairness, and bias detection in machine learning systems, by providing a definition of explainability and classifying existing literature to identify open challenges and future research directions.

There has recently been a surge of work in explanatory artificial intelligence (XAI). This research area tackles the important problem that complex machines and algorithms often cannot provide insights into their behavior and thought processes. XAI allows users and parts of the internal system to be more transparent, providing explanations of their decisions in some level of detail. These explanations are important to ensure algorithmic fairness, identify potential bias/problems in the training data, and to ensure that the algorithms perform as expected. However, explanations produced by these systems is neither standardized nor systematically assessed. In an effort to create best practices and identify open challenges, we provide our definition of explainability and show how it can be used to classify existing literature. We discuss why current approaches to explanatory methods especially for deep neural networks are insufficient. Finally, based on our survey, we conclude with suggested future research directions for explanatory artificial intelligence.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes