LGDec 17, 2021

Community-based Layerwise Distributed Training of Graph Convolutional Networks

arXiv:2112.09335v1
Originality Incremental advance
AI Analysis

This addresses efficiency issues for researchers and practitioners working with large graph-based applications, though it appears incremental as it builds on existing ADMM methods.

The paper tackles the computational and memory challenges of training large-scale Graph Convolutional Networks (GCNs) by proposing a parallel and distributed training algorithm based on ADMM, achieving more than triple speedup while matching state-of-the-art performance.

The Graph Convolutional Network (GCN) has been successfully applied to many graph-based applications. Training a large-scale GCN model, however, is still challenging: Due to the node dependency and layer dependency of the GCN architecture, a huge amount of computational time and memory is required in the training process. In this paper, we propose a parallel and distributed GCN training algorithm based on the Alternating Direction Method of Multipliers (ADMM) to tackle the two challenges simultaneously. We first split GCN layers into independent blocks to achieve layer parallelism. Furthermore, we reduce node dependency by dividing the graph into several dense communities such that each of them can be trained with an agent in parallel. Finally, we provide solutions for all subproblems in the community-based ADMM algorithm. Preliminary results demonstrate that our proposed community-based ADMM training algorithm can lead to more than triple speedup while achieving the best performance compared with state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes