LGJan 24, 2025

GraphSOS: Graph Sampling and Order Selection to Help LLMs Understand Graphs Better

arXiv:2501.14427v34 citationsh-index: 9
Originality Incremental advance
AI Analysis

This addresses a specific bottleneck in applying LLMs to graph data, offering an incremental improvement for researchers and practitioners in graph machine learning.

The paper tackles the problem of LLMs' sensitivity to node/edge order and ineffective neighbor sampling in graph tasks, introducing GraphSOS to improve performance and generalization, with experiments showing enhanced results on node classification and graph QA datasets.

The success of Large Language Models (LLMs) in various domains has led researchers to apply them to graph-related problems by converting graph data into natural language text. However, unlike graph data, natural language inherently has sequential order. We observe a counter-intuitive fact that when the order of nodes or edges in the natural language description of a graph is shuffled, despite describing the same graph, model performance fluctuates between high performance and random guessing. Additionally, due to LLMs' limited input context length, current methods typically randomly sample neighbors of target nodes as representatives of their neighborhood, which may not always be effective for accurate reasoning. To address these gaps, we introduce GraphSOS (Graph Sampling and Order Selection). This novel model framework features an Order Selector Module to ensure proper serialization order of the graph and a Subgraph Sampling Module to sample subgraphs with better structure for better reasoning. Furthermore, we propose Graph CoT obtained through distillation, and enhance LLM's reasoning and zero-shot learning capabilities for graph tasks through instruction tuning. Experiments on multiple datasets for node classification and graph question-answering demonstrate that GraphSOS improves LLMs' performance and generalization ability on graph tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes