LGAINEFeb 8, 2023

Attending to Graph Transformers

DeepMind
arXiv:2302.04181v3128 citationsh-index: 25Has Code
Originality Synthesis-oriented
AI Analysis

This work organizes and evaluates graph transformers, which are emerging as alternatives to graph neural networks, addressing issues like over-squashing for researchers in graph machine learning.

The paper provides a taxonomy and overview of graph transformer architectures, analyzing their theoretical properties and empirical performance on tasks like recovering graph properties and handling heterophilic graphs.

Recently, transformer architectures for graphs emerged as an alternative to established techniques for machine learning with graphs, such as (message-passing) graph neural networks. So far, they have shown promising empirical results, e.g., on molecular prediction datasets, often attributed to their ability to circumvent graph neural networks' shortcomings, such as over-smoothing and over-squashing. Here, we derive a taxonomy of graph transformer architectures, bringing some order to this emerging field. We overview their theoretical properties, survey structural and positional encodings, and discuss extensions for important graph classes, e.g., 3D molecular graphs. Empirically, we probe how well graph transformers can recover various graph properties, how well they can deal with heterophilic graphs, and to what extent they prevent over-squashing. Further, we outline open challenges and research direction to stimulate future work. Our code is available at https://github.com/luis-mueller/probing-graph-transformers.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes