LG AI IT NE SIJun 7, 2024

Faithful and Accurate Self-Attention Attribution for Message Passing Neural Networks via the Computation Tree Viewpoint

Yong-Min Shin, Siqing Li, Xin Cao, Won-Yong Shin

arXiv:2406.04612v27.95 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses the explainability gap in widely used attention-based graph neural networks, offering a more faithful and accurate interpretation method for researchers and practitioners in machine learning.

The study tackled the problem of deriving accurate attribution scores from self-attention in message passing neural networks (Att-GNNs) for explainable AI, proposing GATT, a method based on computation trees that significantly improves edge attribution scores compared to naive approaches, as demonstrated empirically on synthetic and real-world datasets.

The self-attention mechanism has been adopted in various popular message passing neural networks (MPNNs), enabling the model to adaptively control the amount of information that flows along the edges of the underlying graph. Such attention-based MPNNs (Att-GNNs) have also been used as a baseline for multiple studies on explainable AI (XAI) since attention has steadily been seen as natural model interpretations, while being a viewpoint that has already been popularized in other domains (e.g., natural language processing and computer vision). However, existing studies often use naive calculations to derive attribution scores from attention, undermining the potential of attention as interpretations for Att-GNNs. In our study, we aim to fill the gap between the widespread usage of Att-GNNs and their potential explainability via attention. To this end, we propose GATT, edge attribution calculation method for self-attention MPNNs based on the computation tree, a rooted tree that reflects the computation process of the underlying model. Despite its simplicity, we empirically demonstrate the effectiveness of GATT in three aspects of model explanation: faithfulness, explanation accuracy, and case studies by using both synthetic and real-world benchmark datasets. In all cases, the results demonstrate that GATT greatly improves edge attribution scores, especially compared to the previous naive approach. Our code is available at https://github.com/jordan7186/GAtt.

View on arXiv PDF Code

Similar