CVLGNov 13, 2020

One Explanation is Not Enough: Structured Attention Graphs for Image Classification

arXiv:2011.06733v448 citations
Originality Incremental advance
AI Analysis

This work addresses the need for more comprehensive interpretability tools in image classification, offering an incremental improvement over existing attention-based methods.

The paper tackles the problem of incomplete explanations from single attention maps in image classification by introducing structured attention graphs (SAGs) to represent multiple attention maps, and a user study shows users are more correct in answering counterfactual questions with SAGs compared to baselines.

Attention maps are a popular way of explaining the decisions of convolutional networks for image classification. Typically, for each image of interest, a single attention map is produced, which assigns weights to pixels based on their importance to the classification. A single attention map, however, provides an incomplete understanding since there are often many other maps that explain a classification equally well. In this paper, we introduce structured attention graphs (SAGs), which compactly represent sets of attention maps for an image by capturing how different combinations of image regions impact a classifier's confidence. We propose an approach to compute SAGs and a visualization for SAGs so that deeper insight can be gained into a classifier's decisions. We conduct a user study comparing the use of SAGs to traditional attention maps for answering counterfactual questions about image classifications. Our results show that the users are more correct when answering comparative counterfactual questions based on SAGs compared to the baselines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes