CVAIMMMar 8, 2024

Feature CAM: Interpretable AI in Image Classification

arXiv:2403.05658v14 citationsh-index: 2
Originality Incremental advance
AI Analysis

This addresses the lack of trust in AI for critical fields like security and health by providing more interpretable models, though it is incremental as it builds on existing activation-based methods.

The paper tackled the problem of interpretability in deep neural networks for image classification by introducing Feature CAM, a novel technique that produces saliency maps 3-4 times more human-interpretable than state-of-the-art activation-based methods while maintaining machine interpretability.

Deep Neural Networks have often been called the black box because of the complex, deep architecture and non-transparency presented by the inner layers. There is a lack of trust to use Artificial Intelligence in critical and high-precision fields such as security, finance, health, and manufacturing industries. A lot of focused work has been done to provide interpretable models, intending to deliver meaningful insights into the thoughts and behavior of neural networks. In our research, we compare the state-of-the-art methods in the Activation-based methods (ABM) for interpreting predictions of CNN models, specifically in the application of Image Classification. We then extend the same for eight CNN-based architectures to compare the differences in visualization and thus interpretability. We introduced a novel technique Feature CAM, which falls in the perturbation-activation combination, to create fine-grained, class-discriminative visualizations. The resulting saliency maps from our experiments proved to be 3-4 times better human interpretable than the state-of-the-art in ABM. At the same time it reserves machine interpretability, which is the average confidence scores in classification.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes