LGApr 17, 2025

Inference-friendly Graph Compression for Graph Neural Networks

arXiv:2504.13034v39.42 citationsh-index: 2Proc VLDB Endow

Originality Incremental advance

AI Analysis

This work addresses the scalability issue for GNN users in graph analysis, offering practical compression schemes to reduce inference time, though it is incremental as it builds on existing GNN methods.

The paper tackles the high inference cost of Graph Neural Networks (GNNs) on large graphs by proposing inference-friendly graph compression (IFGC), which compresses graphs to preserve inference results with minimal decompression, achieving significant speed-ups and maintaining accuracy in experiments.

Graph Neural Networks (GNNs) have demonstrated promising performance in graph analysis. Nevertheless, the inference process of GNNs remains costly, hindering their applications for large graphs. This paper proposes inference-friendly graph compression (IFGC), a graph compression scheme to accelerate GNNs inference. Given a graph $G$ and a GNN $M$, an IFGC computes a small compressed graph $G_c$, to best preserve the inference results of $M$ over $G$, such that the result can be directly inferred by accessing $G_c$ with no or little decompression cost. (1) We characterize IFGC with a class of inference equivalence relation. The relation captures the node pairs in $G$ that are not distinguishable for GNN inference. (2) We introduce three practical specifications of IFGC for representative GNNs: structural preserving compression (SPGC), which computes $G_c$ that can be directly processed by GNN inference without decompression; ($α$, $r$)-compression, that allows for a configurable trade-off between compression ratio and inference quality, and anchored compression that preserves inference results for specific nodes of interest. For each scheme, we introduce compression and inference algorithms with guarantees of efficiency and quality of the inferred results. We conduct extensive experiments on diverse sets of large-scale graphs, which verifies the effectiveness and efficiency of our graph compression approaches.

View on arXiv PDF

Similar