LGAISep 18, 2023

GDM: Dual Mixup for Graph Classification with Limited Supervision

arXiv:2309.10134v11 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses the annotation cost problem for researchers and practitioners using Graph Neural Networks in domains with limited graph data, though it is an incremental improvement over existing mixup-based methods.

The paper tackles the problem of limited labeled graph samples for graph classification by proposing Graph Dual Mixup (GDM), a graph augmentation method that generates new graph instances using mixup on structural and functional information, and it substantially outperforms state-of-the-art methods on benchmark datasets when labeled graphs are scarce.

Graph Neural Networks (GNNs) require a large number of labeled graph samples to obtain good performance on the graph classification task. The performance of GNNs degrades significantly as the number of labeled graph samples decreases. To reduce the annotation cost, it is therefore important to develop graph augmentation methods that can generate new graph instances to increase the size and diversity of the limited set of available labeled graph samples. In this work, we propose a novel mixup-based graph augmentation method, Graph Dual Mixup (GDM), that leverages both functional and structural information of the graph instances to generate new labeled graph samples. GDM employs a graph structural auto-encoder to learn structural embeddings of the graph samples, and then applies mixup to the structural information of the graphs in the learned structural embedding space and generates new graph structures from the mixup structural embeddings. As for the functional information, GDM applies mixup directly to the input node features of the graph samples to generate functional node feature information for new mixup graph instances. Jointly, the generated input node features and graph structures yield new graph samples which can supplement the set of original labeled graphs. Furthermore, we propose two novel Balanced Graph Sampling methods to enhance the balanced difficulty and diversity for the generated graph samples. Experimental results on the benchmark datasets demonstrate that our proposed method substantially outperforms the state-of-the-art graph augmentation methods when the labeled graphs are scarce.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes