LGAISIMar 12, 2024

Graph Data Condensation via Self-expressive Graph Structure Reconstruction

arXiv:2403.07294v226 citationsh-index: 7Has CodeKDD
Originality Incremental advance
AI Analysis

This addresses efficiency challenges in graph machine learning for researchers and practitioners, though it is incremental as it builds on existing graph data condensation methods.

The paper tackles the problem of reducing storage and time costs for training graph neural networks on large-scale graphs by condensing them into smaller synthetic graphs, achieving improved performance across diverse GNN models and datasets.

With the increasing demands of training graph neural networks (GNNs) on large-scale graphs, graph data condensation has emerged as a critical technique to relieve the storage and time costs during the training phase. It aims to condense the original large-scale graph to a much smaller synthetic graph while preserving the essential information necessary for efficiently training a downstream GNN. However, existing methods concentrate either on optimizing node features exclusively or endeavor to independently learn node features and the graph structure generator. They could not explicitly leverage the information of the original graph structure and failed to construct an interpretable graph structure for the synthetic dataset. To address these issues, we introduce a novel framework named \textbf{G}raph Data \textbf{C}ondensation via \textbf{S}elf-expressive Graph Structure \textbf{R}econstruction (\textbf{GCSR}). Our method stands out by (1) explicitly incorporating the original graph structure into the condensing process and (2) capturing the nuanced interdependencies between the condensed nodes by reconstructing an interpretable self-expressive graph structure. Extensive experiments and comprehensive analysis validate the efficacy of the proposed method across diverse GNN models and datasets. Our code is available at \url{https://github.com/zclzcl0223/GCSR}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes