LGAIDec 18, 2024

SHAP scores fail pervasively even when Lipschitz succeeds

arXiv:2412.13866v13 citationsh-index: 12
Originality Incremental advance
AI Analysis

This work highlights critical flaws in a popular XAI tool, potentially affecting researchers and practitioners relying on SHAP for model interpretability, though it is incremental in building on prior examples.

The paper demonstrates that SHAP scores, a widely used explainable AI method, are pervasively unsatisfactory across various machine learning models, including Boolean classifiers and regression models, even when those models respect Lipschitz continuity or are arbitrarily differentiable.

The ubiquitous use of Shapley values in eXplainable AI (XAI) has been triggered by the tool SHAP, and as a result are commonly referred to as SHAP scores. Recent work devised examples of machine learning (ML) classifiers for which the computed SHAP scores are thoroughly unsatisfactory, by allowing human decision-makers to be misled. Nevertheless, such examples could be perceived as somewhat artificial, since the selected classes must be interpreted as numeric. Furthermore, it was unclear how general were the issues identified with SHAP scores. This paper answers these criticisms. First, the paper shows that for Boolean classifiers there are arbitrarily many examples for which the SHAP scores must be deemed unsatisfactory. Second, the paper shows that the issues with SHAP scores are also observed in the case of regression models. In addition, the paper studies the class of regression models that respect Lipschitz continuity, a measure of a function's rate of change that finds important recent uses in ML, including model robustness. Concretely, the paper shows that the issues with SHAP scores occur even for regression models that respect Lipschitz continuity. Finally, the paper shows that the same issues are guaranteed to exist for arbitrarily differentiable regression models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes