LGJan 31, 2023

Transformers Meet Directed Graphs

Simon Geisler, Yujia Li, Daniel Mankowitz, Ali Taylan Cemgil, Stephan Günnemann, Cosmin Paduraru

DeepMind

arXiv:2302.00049v324.352 citationsh-index: 34Has Code

Originality Incremental advance

AI Analysis

This work addresses a gap in transformer models for directed graphs, which is incremental but important for applications like source code understanding and correctness testing.

The paper tackles the problem of applying transformers to directed graphs, which are common in domains like source code and logic circuits but underexplored, by proposing two direction-aware positional encodings and showing that they improve performance, such as achieving a 14.7% relative improvement over prior state-of-the-art on the Open Graph Benchmark Code2.

Transformers were originally proposed as a sequence-to-sequence model for text but have become vital for a wide range of modalities, including images, audio, video, and undirected graphs. However, transformers for directed graphs are a surprisingly underexplored topic, despite their applicability to ubiquitous domains, including source code and logic circuits. In this work, we propose two direction- and structure-aware positional encodings for directed graphs: (1) the eigenvectors of the Magnetic Laplacian - a direction-aware generalization of the combinatorial Laplacian; (2) directional random walk encodings. Empirically, we show that the extra directionality information is useful in various downstream tasks, including correctness testing of sorting networks and source code understanding. Together with a data-flow-centric graph construction, our model outperforms the prior state of the art on the Open Graph Benchmark Code2 relatively by 14.7%.

View on arXiv PDF Code

Similar