IVCVLGNov 17, 2020

Use HiResCAM instead of Grad-CAM for faithful explanations of convolutional neural networks

arXiv:2011.08891v4183 citations
AI Analysis

This work improves the faithfulness of CNN explanations, which is crucial for developers building trustworthy models in sensitive applications.

This paper addresses a limitation in Grad-CAM, where it sometimes highlights irrelevant regions due to gradient averaging. They propose HiResCAM, a new method that guarantees to highlight only the regions the model actually used, leading to more faithful explanations compared to Grad-CAM on PASCAL VOC 2012.

Explanation methods facilitate the development of models that learn meaningful concepts and avoid exploiting spurious correlations. We illustrate a previously unrecognized limitation of the popular neural network explanation method Grad-CAM: as a side effect of the gradient averaging step, Grad-CAM sometimes highlights locations the model did not actually use. To solve this problem, we propose HiResCAM, a novel class-specific explanation method that is guaranteed to highlight only the locations the model used to make each prediction. We prove that HiResCAM is a generalization of CAM and explore the relationships between HiResCAM and other gradient-based explanation methods. Experiments on PASCAL VOC 2012, including crowd-sourced evaluations, illustrate that while HiResCAM's explanations faithfully reflect the model, Grad-CAM often expands the attention to create bigger and smoother visualizations. Overall, this work advances convolutional neural network explanation approaches and may aid in the development of trustworthy models for sensitive applications.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes