LGAIJan 21, 2024

Graph Edits for Counterfactual Explanations: A comparative study

arXiv:2401.11609v32 citationsxAI
Originality Incremental advance
AI Analysis

This work addresses the need for efficient and effective explainability techniques in AI, particularly for image classification, but it is incremental as it builds on prior graph-based methods.

The study compared supervised and unsupervised Graph Neural Network approaches to generate minimal and meaningful counterfactual explanations for black-box image classifiers, finding that unsupervised methods were more time-efficient while supervised ones offered better performance in terms of edit minimality.

Counterfactuals have been established as a popular explainability technique which leverages a set of minimal edits to alter the prediction of a classifier. When considering conceptual counterfactuals on images, the edits requested should correspond to salient concepts present in the input data. At the same time, conceptual distances are defined by knowledge graphs, ensuring the optimality of conceptual edits. In this work, we extend previous endeavors on graph edits as counterfactual explanations by conducting a comparative study which encompasses both supervised and unsupervised Graph Neural Network (GNN) approaches. To this end, we pose the following significant research question: should we represent input data as graphs, which is the optimal GNN approach in terms of performance and time efficiency to generate minimal and meaningful counterfactual explanations for black-box image classifiers?

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes