A Manifold Perspective on the Statistical Generalization of Graph Neural Networks
This provides a foundational theoretical guarantee for GNN generalization on unseen data over manifolds, offering insights for practical design, though it is incremental in extending existing theory.
The paper tackles the lack of theoretical understanding of Graph Neural Networks' generalization by establishing a manifold perspective in the spectral domain, proving that generalization bounds decrease linearly with graph size in logarithmic scale and increase with spectral continuity constants.
Graph Neural Networks (GNNs) extend convolutional neural networks to operate on graphs. Despite their impressive performances in various graph learning tasks, the theoretical understanding of their generalization capability is still lacking. Previous GNN generalization bounds ignore the underlying graph structures, often leading to bounds that increase with the number of nodes -- a behavior contrary to the one experienced in practice. In this paper, we take a manifold perspective to establish the statistical generalization theory of GNNs on graphs sampled from a manifold in the spectral domain. As demonstrated empirically, we prove that the generalization bounds of GNNs decrease linearly with the size of the graphs in the logarithmic scale, and increase linearly with the spectral continuity constants of the filter functions. Notably, our theory explains both node-level and graph-level tasks. Our result has two implications: i) guaranteeing the generalization of GNNs to unseen data over manifolds; ii) providing insights into the practical design of GNNs, i.e., restrictions on the discriminability of GNNs are necessary to obtain a better generalization performance. We demonstrate our generalization bounds of GNNs using synthetic and multiple real-world datasets.