LGJun 7, 2021

Rethinking Graph Transformers with Spectral Attention

arXiv:2106.03893v3770 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of effectively using Transformers for graph data, which is significant for researchers and practitioners in graph machine learning, though it appears incremental as it builds on existing Transformer and GNN methods.

The paper tackled the challenge of applying Transformers to graphs by introducing the Spectral Attention Network (SAN), which uses a learned positional encoding based on the full Laplacian spectrum to improve node positioning and avoid over-squashing, resulting in performance on par or better than state-of-the-art GNNs on standard datasets.

In recent years, the Transformer architecture has proven to be very successful in sequence processing, but its application to other data structures, such as graphs, has remained limited due to the difficulty of properly defining positions. Here, we present the $\textit{Spectral Attention Network}$ (SAN), which uses a learned positional encoding (LPE) that can take advantage of the full Laplacian spectrum to learn the position of each node in a given graph. This LPE is then added to the node features of the graph and passed to a fully-connected Transformer. By leveraging the full spectrum of the Laplacian, our model is theoretically powerful in distinguishing graphs, and can better detect similar sub-structures from their resonance. Further, by fully connecting the graph, the Transformer does not suffer from over-squashing, an information bottleneck of most GNNs, and enables better modeling of physical phenomenons such as heat transfer and electric interaction. When tested empirically on a set of 4 standard datasets, our model performs on par or better than state-of-the-art GNNs, and outperforms any attention-based model by a wide margin, becoming the first fully-connected architecture to perform well on graph benchmarks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes