CV AI LGMay 31, 2021

Bounded logit attention: Learning to explain image classifiers

Thomas Baumhauer, Djordje Slijepcevic, Matthias Zeppelzauer

arXiv:2105.14824v14.73 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the need for scalable and variable-sized explanations in explainable AI for image classification, though it is incremental over prior methods like L2X.

The paper tackles the problem of explaining image classifiers by introducing bounded logit attention (BLA), a trainable module that selects subsets of convolutional features as explanations, which in a user study were preferred over Grad-CAM explanations.

Explainable artificial intelligence is the attempt to elucidate the workings of systems too complex to be directly accessible to human cognition through suitable side-information referred to as "explanations". We present a trainable explanation module for convolutional image classifiers we call bounded logit attention (BLA). The BLA module learns to select a subset of the convolutional feature map for each input instance, which then serves as an explanation for the classifier's prediction. BLA overcomes several limitations of the instancewise feature selection method "learning to explain" (L2X) introduced by Chen et al. (2018): 1) BLA scales to real-world sized image classification problems, and 2) BLA offers a canonical way to learn explanations of variable size. Due to its modularity BLA lends itself to transfer learning setups and can also be employed as a post-hoc add-on to trained classifiers. Beyond explainability, BLA may serve as a general purpose method for differentiable approximation of subset selection. In a user study we find that BLA explanations are preferred over explanations generated by the popular (Grad-)CAM method.

View on arXiv PDF Code

Similar