LG CLAug 21, 2025

CITE: A Comprehensive Benchmark for Heterogeneous Text-Attributed Graphs on Catalytic Materials

Chenghao Zhang, Qingqing Long, Ludi Wang, Wenjuan Cui, Jianjun Yu, Yi Du

arXiv:2508.15392v17.11 citationsh-index: 4

Originality Synthesis-oriented

AI Analysis

This addresses a critical bottleneck for researchers in representation learning on heterogeneous text-attributed graphs, though it is incremental as it focuses on creating a dataset rather than a new method.

The paper tackles the lack of large-scale benchmark datasets for heterogeneous text-attributed graphs by introducing CITE, a comprehensive benchmark for catalytic materials, comprising over 438K nodes and 1.2M edges, and establishes standardized evaluation procedures with benchmarking on node classification tasks.

Text-attributed graphs(TAGs) are pervasive in real-world systems,where each node carries its own textual features. In many cases these graphs are inherently heterogeneous, containing multiple node types and diverse edge types. Despite the ubiquity of such heterogeneous TAGs, there remains a lack of large-scale benchmark datasets. This shortage has become a critical bottleneck, hindering the development and fair comparison of representation learning methods on heterogeneous text-attributed graphs. In this paper, we introduce CITE - Catalytic Information Textual Entities Graph, the first and largest heterogeneous text-attributed citation graph benchmark for catalytic materials. CITE comprises over 438K nodes and 1.2M edges, spanning four relation types. In addition, we establish standardized evaluation procedures and conduct extensive benchmarking on the node classification task, as well as ablation experiments on the heterogeneous and textual properties of CITE. We compare four classes of learning paradigms, including homogeneous graph models, heterogeneous graph models, LLM(Large Language Model)-centric models, and LLM+Graph models. In a nutshell, we provide (i) an overview of the CITE dataset, (ii) standardized evaluation protocols, and (iii) baseline and ablation experiments across diverse modeling paradigms.

View on arXiv PDF

Similar