LG AIJan 21, 2024

Graph Edits for Counterfactual Explanations: A comparative study

Angeliki Dimitriou, Nikolaos Chaidos, Maria Lymperaiou, Giorgos Stamou

arXiv:2401.11609v34.62 citationsxAI

Originality Incremental advance

AI Analysis

This work addresses the need for efficient and effective explainability techniques in AI, particularly for image classification, but it is incremental as it builds on prior graph-based methods.

The study compared supervised and unsupervised Graph Neural Network approaches to generate minimal and meaningful counterfactual explanations for black-box image classifiers, finding that unsupervised methods were more time-efficient while supervised ones offered better performance in terms of edit minimality.

Counterfactuals have been established as a popular explainability technique which leverages a set of minimal edits to alter the prediction of a classifier. When considering conceptual counterfactuals on images, the edits requested should correspond to salient concepts present in the input data. At the same time, conceptual distances are defined by knowledge graphs, ensuring the optimality of conceptual edits. In this work, we extend previous endeavors on graph edits as counterfactual explanations by conducting a comparative study which encompasses both supervised and unsupervised Graph Neural Network (GNN) approaches. To this end, we pose the following significant research question: should we represent input data as graphs, which is the optimal GNN approach in terms of performance and time efficiency to generate minimal and meaningful counterfactual explanations for black-box image classifiers?

View on arXiv PDF

Similar