CVAILGOct 30, 2024

CNN Explainability with Multivector Tucker Saliency Maps for Self-Supervised Models

arXiv:2410.23072v1h-index: 14
Originality Incremental advance
AI Analysis

This work addresses the challenge of explainability in CNNs for researchers and practitioners, particularly for self-supervised models, but it is incremental as it builds on existing methods like EigenCAM.

The paper tackles the problem of interpreting CNN decisions, especially for self-supervised models, by introducing Tucker Saliency Map (TSM) and its multivariant MTSM, which improve saliency map quality and achieve competitive performance with label-dependent methods, enhancing explainability by approximately 50% over EigenCAM.

Interpreting the decisions of Convolutional Neural Networks (CNNs) is essential for understanding their behavior, yet explainability remains a significant challenge, particularly for self-supervised models. Most existing methods for generating saliency maps rely on ground truth labels, restricting their use to supervised tasks. EigenCAM is the only notable label-independent alternative, leveraging Singular Value Decomposition to generate saliency maps applicable across CNN models, but it does not fully exploit the tensorial structure of feature maps. In this work, we introduce the Tucker Saliency Map (TSM) method, which applies Tucker tensor decomposition to better capture the inherent structure of feature maps, producing more accurate singular vectors and values. These are used to generate high-fidelity saliency maps, effectively highlighting objects of interest in the input. We further extend EigenCAM and TSM into multivector variants -Multivec-EigenCAM and Multivector Tucker Saliency Maps (MTSM)- which utilize all singular vectors and values, further improving saliency map quality. Quantitative evaluations on supervised classification models demonstrate that TSM, Multivec-EigenCAM, and MTSM achieve competitive performance with label-dependent methods. Moreover, TSM enhances explainability by approximately 50% over EigenCAM for both supervised and self-supervised models. Multivec-EigenCAM and MTSM further advance state-of-the-art explainability performance on self-supervised models, with MTSM achieving the best results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes