LGMar 16, 2025

Diffusion on Graph: Augmentation of Graph Structure for Node Classification

arXiv:2503.12563v14 citationsh-index: 9Has CodeTrans. Mach. Learn. Res.
Originality Incremental advance
AI Analysis

This addresses a gap in graph diffusion models for node-level tasks, offering a method to enhance GNN training through data augmentation, though it appears incremental as it builds on existing graph diffusion and regularization techniques.

The paper tackles the problem of generating synthetic graph structures for node-level learning tasks, proposing Diffusion on Graph (DoG) with a low-rank regularization method, which improves performance in semi-supervised node classification and graph contrastive learning across various datasets.

Graph diffusion models have recently been proposed to synthesize entire graphs, such as molecule graphs. Although existing methods have shown great performance in generating entire graphs for graph-level learning tasks, no graph diffusion models have been developed to generate synthetic graph structures, that is, synthetic nodes and associated edges within a given graph, for node-level learning tasks. Inspired by the research in the computer vision literature using synthetic data for enhanced performance, we propose Diffusion on Graph (DoG), which generates synthetic graph structures to boost the performance of GNNs. The synthetic graph structures generated by DoG are combined with the original graph to form an augmented graph for the training of node-level learning tasks, such as node classification and graph contrastive learning (GCL). To improve the efficiency of the generation process, a Bi-Level Neighbor Map Decoder (BLND) is introduced in DoG. To mitigate the adverse effect of the noise introduced by the synthetic graph structures, a low-rank regularization method is proposed for the training of graph neural networks (GNNs) on the augmented graphs. Extensive experiments on various graph datasets for semi-supervised node classification and graph contrastive learning have been conducted to demonstrate the effectiveness of DoG with low-rank regularization. The code of DoG is available at https://github.com/Statistical-Deep-Learning/DoG.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes