CVLGJul 16, 2024

Benchmarking the Attribution Quality of Vision Models

arXiv:2407.11910v28 citationsh-index: 10
Originality Incremental advance
AI Analysis

This work provides a more robust benchmarking framework for researchers and practitioners in explainable AI, though it is incremental as it builds on existing evaluation methods.

The paper tackles the challenge of evaluating attribution methods for vision models by proposing a novel evaluation protocol that addresses limitations in existing protocols, enabling the assessment of 23 methods and revealing that intrinsically explainable models outperform standard ones with higher attribution quality than previously known.

Attribution maps are one of the most established tools to explain the functioning of computer vision models. They assign importance scores to input features, indicating how relevant each feature is for the prediction of a deep neural network. While much research has gone into proposing new attribution methods, their proper evaluation remains a difficult challenge. In this work, we propose a novel evaluation protocol that overcomes two fundamental limitations of the widely used incremental-deletion protocol, i.e., the out-of-domain issue and lacking inter-model comparisons. This allows us to evaluate 23 attribution methods and how different design choices of popular vision backbones affect their attribution quality. We find that intrinsically explainable models outperform standard models and that raw attribution values exhibit a higher attribution quality than what is known from previous work. Further, we show consistent changes in the attribution quality when varying the network design, indicating that some standard design choices promote attribution quality.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes