CVAILGJul 25, 2025

SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence

arXiv:2507.19321v12 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses the need for more transparent and interpretable AI models in high-stakes domains like medical imaging and autonomous driving, representing an incremental improvement over existing methods.

The paper tackles the problem of complex and hard-to-understand explanations in prototypical-parts-based neural networks for explainable AI by introducing SIDE, a method that reduces explanation size by over 90% while maintaining accuracy.

Understanding the decisions made by deep neural networks is essential in high-stakes domains such as medical imaging and autonomous driving. Yet, these models often lack transparency, particularly in computer vision. Prototypical-parts-based neural networks have emerged as a promising solution by offering concept-level explanations. However, most are limited to fine-grained classification tasks, with few exceptions such as InfoDisent. InfoDisent extends prototypical models to large-scale datasets like ImageNet, but produces complex explanations. We introduce Sparse Information Disentanglement for Explainability (SIDE), a novel method that improves the interpretability of prototypical parts through a dedicated training and pruning scheme that enforces sparsity. Combined with sigmoid activations in place of softmax, this approach allows SIDE to associate each class with only a small set of relevant prototypes. Extensive experiments show that SIDE matches the accuracy of existing methods while reducing explanation size by over $90\%$, substantially enhancing the understandability of prototype-based explanations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes