CVAILGMay 31, 2021

Bounded logit attention: Learning to explain image classifiers

arXiv:2105.14824v13 citations
Originality Incremental advance
AI Analysis

This addresses the need for scalable and variable-sized explanations in explainable AI for image classification, though it is incremental over prior methods like L2X.

The paper tackles the problem of explaining image classifiers by introducing bounded logit attention (BLA), a trainable module that selects subsets of convolutional features as explanations, which in a user study were preferred over Grad-CAM explanations.

Explainable artificial intelligence is the attempt to elucidate the workings of systems too complex to be directly accessible to human cognition through suitable side-information referred to as "explanations". We present a trainable explanation module for convolutional image classifiers we call bounded logit attention (BLA). The BLA module learns to select a subset of the convolutional feature map for each input instance, which then serves as an explanation for the classifier's prediction. BLA overcomes several limitations of the instancewise feature selection method "learning to explain" (L2X) introduced by Chen et al. (2018): 1) BLA scales to real-world sized image classification problems, and 2) BLA offers a canonical way to learn explanations of variable size. Due to its modularity BLA lends itself to transfer learning setups and can also be employed as a post-hoc add-on to trained classifiers. Beyond explainability, BLA may serve as a general purpose method for differentiable approximation of subset selection. In a user study we find that BLA explanations are preferred over explanations generated by the popular (Grad-)CAM method.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes