LGAIMar 4, 2022

Benchmarking Instance-Centric Counterfactual Algorithms for XAI: From White Box to Black Box

arXiv:2203.02399v413 citationsh-index: 18
Originality Synthesis-oriented
AI Analysis

It addresses the reliability of counterfactual explanations in XAI for practitioners, highlighting incremental insights into algorithm limitations.

This study benchmarked counterfactual explanation algorithms across different model types and datasets, finding that model type has little impact, algorithms based solely on proximity are not actionable, and plausibility is essential for meaningful evaluation.

This study investigates the impact of machine learning models on the generation of counterfactual explanations by conducting a benchmark evaluation over three different types of models: a decision tree (fully transparent, interpretable, white-box model), a random forest (semi-interpretable, grey-box model), and a neural network (fully opaque, black-box model). We tested the counterfactual generation process using four algorithms (DiCE, WatcherCF, prototype, and GrowingSpheresCF) in the literature in 25 different datasets. Our findings indicate that: (1) Different machine learning models have little impact on the generation of counterfactual explanations; (2) Counterfactual algorithms based uniquely on proximity loss functions are not actionable and will not provide meaningful explanations; (3) One cannot have meaningful evaluation results without guaranteeing plausibility in the counterfactual generation. Algorithms that do not consider plausibility in their internal mechanisms will lead to biased and unreliable conclusions if evaluated with the current state-of-the-art metrics; (4) A counterfactual inspection analysis is strongly recommended to ensure a robust examination of counterfactual explanations and the potential identification of biases.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes