LGMay 23, 2025

Graph Inverse Style Transfer for Counterfactual Explainability

Bardh Prenkaj, Efstratios Zaradoukas, Gjergji Kasneci

arXiv:2505.17542v24.11 citationsh-index: 36Has CodeICML

Originality Highly original

AI Analysis

It addresses the challenge of explainability for graph models, offering a novel approach that advances the field beyond traditional methods.

The paper tackles the problem of generating counterfactual explanations for graph data by introducing Graph Inverse Style Transfer (GIST), a backtracking framework that improves validity by +7.6% and faithfulness by +45.5% across 8 benchmarks.

Counterfactual explainability seeks to uncover model decisions by identifying minimal changes to the input that alter the predicted outcome. This task becomes particularly challenging for graph data due to preserving structural integrity and semantic meaning. Unlike prior approaches that rely on forward perturbation mechanisms, we introduce Graph Inverse Style Transfer (GIST), the first framework to re-imagine graph counterfactual generation as a backtracking process, leveraging spectral style transfer. By aligning the global structure with the original input spectrum and preserving local content faithfulness, GIST produces valid counterfactuals as interpolations between the input style and counterfactual content. Tested on 8 binary and multi-class graph classification benchmarks, GIST achieves a remarkable +7.6% improvement in the validity of produced counterfactuals and significant gains (+45.5%) in faithfully explaining the true class distribution. Additionally, GIST's backtracking mechanism effectively mitigates overshooting the underlying predictor's decision boundary, minimizing the spectral differences between the input and the counterfactuals. These results challenge traditional forward perturbation methods, offering a novel perspective that advances graph explainability.

View on arXiv PDF Code

Similar