LGAIJun 21, 2023

Evaluating the overall sensitivity of saliency-based explanation methods

arXiv:2306.13682v1h-index: 51
Originality Synthesis-oriented
AI Analysis

This work addresses the need for rigorous, cross-domain evaluation of explanation methods in AI interpretability, though it is incremental as it builds on an existing test.

The authors tackled the problem of evaluating the faithfulness of saliency-based explanation methods for deep learning models by extending an existing model-agnostic test to formally assess sensitivity, and they demonstrated its application in comparing multiple methods for Convolutional Neural Networks.

We address the need to generate faithful explanations of "black box" Deep Learning models. Several tests have been proposed to determine aspects of faithfulness of explanation methods, but they lack cross-domain applicability and a rigorous methodology. Hence, we select an existing test that is model agnostic and is well-suited for comparing one aspect of faithfulness (i.e., sensitivity) of multiple explanation methods, and extend it by specifying formal thresh-olds and building criteria to determine the over-all sensitivity of the explanation method. We present examples of how multiple explanation methods for Convolutional Neural Networks can be compared using this extended methodology. Finally, we discuss the relationship between sensitivity and faithfulness and consider how the test can be adapted to assess different explanation methods in other domains.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes