LGAIJan 2, 2025

Graph Generative Pre-trained Transformer

arXiv:2501.01073v218 citationsh-index: 7Has CodeICML
Originality Incremental advance
AI Analysis

This work addresses graph generation for domains like molecular design and social network analysis, presenting an incremental improvement with a novel representation and model.

The paper tackles graph generation by proposing a sequence-based representation and the Graph Generative Pre-trained Transformer (G2PT), achieving superior generative performance on generic graph and molecule datasets.

Graph generation is a critical task in numerous domains, including molecular design and social network analysis, due to its ability to model complex relationships and structured data. While most modern graph generative models utilize adjacency matrix representations, this work revisits an alternative approach that represents graphs as sequences of node set and edge set. We advocate for this approach due to its efficient encoding of graphs and propose a novel representation. Based on this representation, we introduce the Graph Generative Pre-trained Transformer (G2PT), an auto-regressive model that learns graph structures via next-token prediction. To further exploit G2PT's capabilities as a general-purpose foundation model, we explore fine-tuning strategies for two downstream applications: goal-oriented generation and graph property prediction. We conduct extensive experiments across multiple datasets. Results indicate that G2PT achieves superior generative performance on both generic graph and molecule datasets. Furthermore, G2PT exhibits strong adaptability and versatility in downstream tasks from molecular design to property prediction. Code available at https://github.com/tufts-ml/G2PT,

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes