AICYLGJun 26, 2018

Open the Black Box Data-Driven Explanation of Black Box Decision Systems

arXiv:1806.09936v139 citations
Originality Highly original
AI Analysis

This addresses transparency and bias issues in automated decision-making systems, which is crucial for fairness and accountability in applications like finance or hiring.

The paper tackles the problem of explaining black-box decision systems by introducing a local-to-global framework that uses logic-based rules to provide interpretable explanations, with promising early results.

Black box systems for automated decision making, often based on machine learning over (big) data, map a user's features into a class or a score without exposing the reasons why. This is problematic not only for lack of transparency, but also for possible biases hidden in the algorithms, due to human prejudices and collection artifacts hidden in the training data, which may lead to unfair or wrong decisions. We introduce the local-to-global framework for black box explanation, a novel approach with promising early results, which paves the road for a wide spectrum of future developments along three dimensions: (i) the language for expressing explanations in terms of highly expressive logic-based rules, with a statistical and causal interpretation; (ii) the inference of local explanations aimed at revealing the logic of the decision adopted for a specific instance by querying and auditing the black box in the vicinity of the target instance; (iii), the bottom-up generalization of the many local explanations into simple global ones, with algorithms that optimize the quality and comprehensibility of explanations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes