LGDCJun 10, 2024

GraphStorm: all-in-one graph machine learning framework for industry applications

arXiv:2406.06022v19 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This framework addresses the problem of applying graph machine learning to large-scale industry applications for practitioners and researchers, though it appears incremental as it builds on existing GML techniques.

The authors tackled the challenge of making graph machine learning easy to use and scalable for industry applications with massive datasets by developing GraphStorm, an all-in-one framework that provides end-to-end solutions for graph construction, training, and inference, which has been deployed in over a dozen billion-scale industry applications.

Graph machine learning (GML) is effective in many business applications. However, making GML easy to use and applicable to industry applications with massive datasets remain challenging. We developed GraphStorm, which provides an end-to-end solution for scalable graph construction, graph model training and inference. GraphStorm has the following desirable properties: (a) Easy to use: it can perform graph construction and model training and inference with just a single command; (b) Expert-friendly: GraphStorm contains many advanced GML modeling techniques to handle complex graph data and improve model performance; (c) Scalable: every component in GraphStorm can operate on graphs with billions of nodes and can scale model training and inference to different hardware without changing any code. GraphStorm has been used and deployed for over a dozen billion-scale industry applications after its release in May 2023. It is open-sourced in Github: https://github.com/awslabs/graphstorm.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes