LGAIMay 7

Concept-Based Abductive and Contrastive Explanations for Behaviors of Vision Models

arXiv:2605.0664017.8
Predicted impact top 27% in LG · last 90 daysOriginality Highly original
AI Analysis

For researchers and practitioners needing interpretable AI, this work merges causal concept explanations with formal abductive reasoning to provide minimal, high-level explanations of model behavior.

The paper introduces concept-based abductive and contrastive explanations that identify minimal sets of high-level concepts causally relevant for model predictions, and presents algorithms to enumerate these explanations using concept erasure. The approach is evaluated on multiple models and datasets, demonstrating effectiveness in producing user-friendly explanations.

*Concept-based explanations* offer a promising approach for explaining the predictions of deep neural networks in terms of high-level, human-understandable concepts. However, existing methods either do not establish a causal connection between the concepts and model predictions or are limited in expressivity and only able to infer causal explanations involving single concepts. At the same time, the parallel line of work on *formal abductive and contrastive explanations* computes the minimal set of input features causally relevant for model outcomes but only considers low-level features such as pixels. Merging these two threads, in this work, we propose the notion of *concept-based abductive and contrastive explanations* that capture the minimal sets of high-level concepts causally relevant for model outcomes. We then present a family of algorithms that enumerate all minimal explanations while using *concept erasure* procedures to establish causal relationships. By appropriately aggregating such explanations, we are not only able to understand model predictions on individual images but also on collections of images where the model exhibits a user-specified, common *behavior*. We evaluate our approach on multiple models, datasets, and behaviors, and demonstrate its effectiveness in computing helpful, user-friendly explanations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes