MLLGMEMar 25, 2019

Explaining individual predictions when features are dependent: More accurate approximations to Shapley values

arXiv:1903.10464v3921 citations
Originality Incremental advance
AI Analysis

This addresses the issue of providing reliable explanations for individual predictions in machine learning models, particularly when features are not independent, which is an incremental improvement over existing methods.

The paper tackles the problem of inaccurate explanations from Shapley value approximations when features are dependent, and extends Kernel SHAP to handle feature dependence, resulting in more accurate approximations to true Shapley values as demonstrated with linear and non-linear models.

Explaining complex or seemingly simple machine learning models is an important practical problem. We want to explain individual predictions from a complex machine learning model by learning simple, interpretable explanations. Shapley values is a game theoretic concept that can be used for this purpose. The Shapley value framework has a series of desirable theoretical properties, and can in principle handle any predictive model. Kernel SHAP is a computationally efficient approximation to Shapley values in higher dimensions. Like several other existing methods, this approach assumes that the features are independent, which may give very wrong explanations. This is the case even if a simple linear model is used for predictions. In this paper, we extend the Kernel SHAP method to handle dependent features. We provide several examples of linear and non-linear models with various degrees of feature dependence, where our method gives more accurate approximations to the true Shapley values. We also propose a method for aggregating individual Shapley values, such that the prediction can be explained by groups of dependent variables.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes