LGOct 12, 2021

Scalable Consistency Training for Graph Neural Networks via Self-Ensemble Self-Distillation

Cole Hawkins, Vassilis N. Ioannidis, Soji Adeshina, George Karypis

arXiv:2110.06290v13.11 citations

Originality Incremental advance

AI Analysis

This work addresses scalability and accuracy issues for GNNs in network science tasks, representing an incremental advancement by adapting consistency training from other domains to graphs.

The paper tackles the problem of improving graph neural networks (GNNs) on large-scale graphs by introducing a consistency training method that leverages randomness in neighbor subsampling, resulting in accuracy gains, especially with low label rates.

Consistency training is a popular method to improve deep learning models in computer vision and natural language processing. Graph neural networks (GNNs) have achieved remarkable performance in a variety of network science learning tasks, but to date no work has studied the effect of consistency training on large-scale graph problems. GNNs scale to large graphs by minibatch training and subsample node neighbors to deal with high degree nodes. We utilize the randomness inherent in the subsampling of neighbors and introduce a novel consistency training method to improve accuracy. For a target node we generate different neighborhood expansions, and distill the knowledge of the average of the predictions to the GNN. Our method approximates the expected prediction of the possible neighborhood samples and practically only requires a few samples. We demonstrate that our training method outperforms standard GNN training in several different settings, and yields the largest gains when label rates are low.

View on arXiv PDF

Similar