IV CV LGNov 17, 2020

Use HiResCAM instead of Grad-CAM for faithful explanations of convolutional neural networks

arXiv:2011.08891v429.2188 citationsHas Code

Originality Incremental advance

AI Analysis

This work improves the faithfulness of CNN explanations, which is crucial for developers building trustworthy models in sensitive applications.

This paper addresses a limitation in Grad-CAM, where it sometimes highlights irrelevant regions due to gradient averaging. They propose HiResCAM, a new method that guarantees to highlight only the regions the model actually used, leading to more faithful explanations compared to Grad-CAM on PASCAL VOC 2012.

Explanation methods facilitate the development of models that learn meaningful concepts and avoid exploiting spurious correlations. We illustrate a previously unrecognized limitation of the popular neural network explanation method Grad-CAM: as a side effect of the gradient averaging step, Grad-CAM sometimes highlights locations the model did not actually use. To solve this problem, we propose HiResCAM, a novel class-specific explanation method that is guaranteed to highlight only the locations the model used to make each prediction. We prove that HiResCAM is a generalization of CAM and explore the relationships between HiResCAM and other gradient-based explanation methods. Experiments on PASCAL VOC 2012, including crowd-sourced evaluations, illustrate that while HiResCAM's explanations faithfully reflect the model, Grad-CAM often expands the attention to create bigger and smoother visualizations. Overall, this work advances convolutional neural network explanation approaches and may aid in the development of trustworthy models for sensitive applications.

View on arXiv PDF Code

Similar