CVAIMay 14

How to Evaluate and Refine your CAM

arXiv:2605.1464122.3
AI Analysis

For practitioners using CAMs for interpretability, this work provides better evaluation and refinement tools, though it is incremental in nature.

The authors introduce a synthetic dataset with ground-truth attributions to evaluate CAM metrics, propose ARCC as a more reliable metric, and develop RefineCAM for high-resolution attribution maps, which outperforms existing methods.

Class attribution maps (CAMs) provide local explanations for the decisions of convolutional neural networks. While widely used in practice, the evaluation of CAMs remains challenging due to the lack of ground-truth explanations, making it difficult to evaluate the soundness of existing metrics. Independently, most commonly used CAM methods produce low-resolution attribution maps, which limits their usefulness for detailed interpretability. To address the evaluation challenge, we introduce a synthetic dataset with ground-truth attributions that enables a rigorous comparison of CAM evaluation metrics. Using this dataset, we analyze existing metrics and propose ARCC, a new composite metric that more reliably identifies faithful explanations. To address the low resolution issue, we introduce RefineCAM, a method that produces high-resolution attribution maps by aggregating CAMs across multiple network layers. Our results show that RefineCAM consistently outperforms existing methods according to the proposed evaluation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes