AI LGJan 19

SL-CBM: Enhancing Concept Bottleneck Models with Semantic Locality for Better Interpretability

Hanwei Zhang, Luo Cheng, Rui Wen, Yang Zhang, Lijun Zhang, Holger Hermanns

arXiv:2601.12804v16.02 citations

Originality Incremental advance

AI Analysis

This work addresses interpretability issues in high-stakes domains for users of explainable AI, but it is incremental as it builds on existing CBMs with a novel extension.

The paper tackles the problem of poor locality faithfulness in Concept Bottleneck Models (CBMs), which limits their interpretability, by proposing SL-CBM to enforce spatially coherent saliency maps, resulting in substantial improvements in locality faithfulness, explanation quality, and intervention efficacy while maintaining competitive classification accuracy.

Explainable AI (XAI) is crucial for building transparent and trustworthy machine learning systems, especially in high-stakes domains. Concept Bottleneck Models (CBMs) have emerged as a promising ante-hoc approach that provides interpretable, concept-level explanations by explicitly modeling human-understandable concepts. However, existing CBMs often suffer from poor locality faithfulness, failing to spatially align concepts with meaningful image regions, which limits their interpretability and reliability. In this work, we propose SL-CBM (CBM with Semantic Locality), a novel extension that enforces locality faithfulness by generating spatially coherent saliency maps at both concept and class levels. SL-CBM integrates a 1x1 convolutional layer with a cross-attention mechanism to enhance alignment between concepts, image regions, and final predictions. Unlike prior methods, SL-CBM produces faithful saliency maps inherently tied to the model's internal reasoning, facilitating more effective debugging and intervention. Extensive experiments on image datasets demonstrate that SL-CBM substantially improves locality faithfulness, explanation quality, and intervention efficacy while maintaining competitive classification accuracy. Our ablation studies highlight the importance of contrastive and entropy-based regularization for balancing accuracy, sparsity, and faithfulness. Overall, SL-CBM bridges the gap between concept-based reasoning and spatial explainability, setting a new standard for interpretable and trustworthy concept-based models.

View on arXiv PDF

Similar