CVLGMar 10, 2025

Now you see me! Attribution Distributions Reveal What is Truly Important for a Prediction

arXiv:2503.07346v21 citationsh-index: 8Has Code
Originality Incremental advance
AI Analysis

This work addresses the need for better transparency in high-stakes decision-making using neural networks, offering an incremental improvement to existing attribution methods.

The authors tackled the problem of unspecific saliency maps in neural network attribution methods by proposing to compute probability distributions of attributions over classes for each spatial location, which improved object- and instance-specificity and uncovered discriminative and shared features. They demonstrated improvements on benchmarks like the grid-pointing game and randomization-based sanity checks, with the method being architecture-agnostic.

Neural networks are regularly employed in high-stakes decision-making, where understanding and transparency is key. Attribution methods have been developed to gain understanding into which input features neural networks use for a specific prediction. Although widely used in computer vision, these methods often result in unspecific saliency maps that fail to identify the relevant information that led to a decision, supported by different benchmarks results. Here, we revisit the common attribution pipeline and identify one cause for the lack of specificity in attributions as the computation of attribution of isolated logits. Instead, we suggest to combine attributions of multiple class logits in analogy to how the softmax combines the information across logits. By computing probability distributions of attributions over classes for each spatial location in the image, we unleash the true capabilities of existing attribution methods, revealing better object- and instance-specificity and uncovering discriminative as well as shared features between classes. On common benchmarks, including the grid-pointing game and randomization-based sanity checks, we show that this reconsideration of how and where we compute attributions across the network improves established attribution methods while staying agnostic to model architectures. We make the code publicly available: https://github.com/nilspwalter/var.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes