Boosting-GNN: Boosting Algorithm for Graph Networks on Imbalanced Node Classification
This addresses the issue of imbalanced datasets in GNNs, which is a domain-specific problem for researchers and practitioners in graph machine learning, but it is incremental as it adapts known boosting techniques to GNNs.
The paper tackles the problem of imbalanced node classification in graph neural networks (GNNs) by proposing Boosting-GNN, an ensemble model that uses GNNs as base classifiers with boosting and transfer learning, achieving an average performance improvement of 4.5% over existing methods.
The Graph Neural Network (GNN) has been widely used for graph data representation. However, the existing researches only consider the ideal balanced dataset, and the imbalanced dataset is rarely considered. Traditional methods such as resampling, reweighting, and synthetic samples that deal with imbalanced datasets are no longer applicable in GNN. This paper proposes an ensemble model called Boosting-GNN, which uses GNNs as the base classifiers during boosting. In Boosting-GNN, higher weights are set for the training samples that are not correctly classified by the previous classifier, thus achieving higher classification accuracy and better reliability. Besides, transfer learning is used to reduce computational cost and increase fitting ability. Experimental results indicate that the proposed Boosting-GNN model achieves better performance than GCN, GraphSAGE, GAT, SGC, N-GCN, and most advanced reweighting and resampling methods on synthetic imbalanced datasets, with an average performance improvement of 4.5%