LGFeb 2, 2024

Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness

arXiv:2402.01242v115 citationsh-index: 14ICML
Originality Highly original
AI Analysis

This addresses efficiency issues for practitioners using GNNs on large graphs, offering a novel approach that combines topological and semantic awareness to prevent performance collapse at high sparsity.

The paper tackles the computational challenges of Graph Neural Networks (GNNs) on large-scale graphs by proposing Graph Sparse Training (GST), a method that dynamically removes non-essential edges to reduce overhead while preserving performance, achieving up to 3.42× speedup in inference and higher sparsity levels than state-of-the-art methods.

Graph Neural Networks (GNNs) excel in various graph learning tasks but face computational challenges when applied to large-scale graphs. A promising solution is to remove non-essential edges to reduce the computational overheads in GNN. Previous literature generally falls into two categories: topology-guided and semantic-guided. The former maintains certain graph topological properties yet often underperforms on GNNs due to low integration with neural network training. The latter performs well at lower sparsity on GNNs but faces performance collapse at higher sparsity levels. With this in mind, we take the first step to propose a new research line and concept termed Graph Sparse Training (GST), which dynamically manipulates sparsity at the data level. Specifically, GST initially constructs a topology & semantic anchor at a low training cost, followed by performing dynamic sparse training to align the sparse graph with the anchor. We introduce the Equilibria Sparsification Principle to guide this process, effectively balancing the preservation of both topological and semantic information. Ultimately, GST produces a sparse graph with maximum topological integrity and no performance degradation. Extensive experiments on 6 datasets and 5 backbones showcase that GST (I) identifies subgraphs at higher graph sparsity levels (1.67%~15.85% $\uparrow$) than state-of-the-art sparsification methods, (II) preserves more key spectral properties, (III) achieves 1.27-3.42$\times$ speedup in GNN inference and (IV) successfully helps graph adversarial defense and graph lottery tickets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes