CV LGNov 13, 2020

One Explanation is Not Enough: Structured Attention Graphs for Image Classification

Vivswan Shitole, Li Fuxin, Minsuk Kahng, Prasad Tadepalli, Alan Fern

arXiv:2011.06733v415.748 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the need for more comprehensive interpretability tools in image classification, offering an incremental improvement over existing attention-based methods.

The paper tackles the problem of incomplete explanations from single attention maps in image classification by introducing structured attention graphs (SAGs) to represent multiple attention maps, and a user study shows users are more correct in answering counterfactual questions with SAGs compared to baselines.

Attention maps are a popular way of explaining the decisions of convolutional networks for image classification. Typically, for each image of interest, a single attention map is produced, which assigns weights to pixels based on their importance to the classification. A single attention map, however, provides an incomplete understanding since there are often many other maps that explain a classification equally well. In this paper, we introduce structured attention graphs (SAGs), which compactly represent sets of attention maps for an image by capturing how different combinations of image regions impact a classifier's confidence. We propose an approach to compute SAGs and a visualization for SAGs so that deeper insight can be gained into a classifier's decisions. We conduct a user study comparing the use of SAGs to traditional attention maps for answering counterfactual questions about image classifications. Our results show that the users are more correct when answering comparative counterfactual questions based on SAGs compared to the baselines.

View on arXiv PDF Code

Similar