LGDCOct 13, 2021

Scalable Graph Embedding LearningOn A Single GPU

arXiv:2110.06991v21 citations
Originality Incremental advance
AI Analysis

This work addresses scalability issues in graph analytics for real-world applications, though it appears incremental as it builds on existing GPU acceleration methods.

The paper tackles the challenge of scaling graph embedding learning to large-scale networks by introducing a hybrid CPU-GPU framework, enabling training on datasets an order of magnitude larger than a single machine's memory capacity while maintaining performance and accuracy in downstream applications.

Graph embedding techniques have attracted growing interest since they convert the graph data into continuous and low-dimensional space. Effective graph analytic provides users a deeper understanding of what is behind the data and thus can benefit a variety of machine learning tasks. With the current scale of real-world applications, most graph analytic methods suffer high computation and space costs. These methods and systems can process a network with thousands to a few million nodes. However, scaling to large-scale networks remains a challenge. The complexity of training graph embedding system requires the use of existing accelerators such as GPU. In this paper, we introduce a hybrid CPU-GPU framework that addresses the challenges of learning embedding of large-scale graphs. The performance of our method is compared qualitatively and quantitatively with the existing embedding systems on common benchmarks. We also show that our system can scale training to datasets with an order of magnitude greater than a single machine's total memory capacity. The effectiveness of the learned embedding is evaluated within multiple downstream applications. The experimental results indicate the effectiveness of the learned embedding in terms of performance and accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes