Manifold limit for the training of shallow graph convolutional neural networks

Johanna Tengler, Christoph Brune, José A. Iglesias

arXiv:2601.06025v13.21 citationsh-index: 4

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of training stability for graph neural networks in machine learning, offering a theoretical foundation for consistency across varying data resolutions, but it is incremental as it builds on existing manifold and functional-analytic frameworks.

The paper tackles the problem of ensuring consistent training of shallow graph convolutional neural networks across different graph resolutions by proving Γ-convergence of regularized empirical risk minimization functionals and convergence of their global minimizers, providing a formalization of mesh and sample independence.

We study the discrete-to-continuum consistency of the training of shallow graph convolutional neural networks (GCNNs) on proximity graphs of sampled point clouds under a manifold assumption. Graph convolution is defined spectrally via the graph Laplacian, whose low-frequency spectrum approximates that of the Laplace-Beltrami operator of the underlying smooth manifold, and shallow GCNNs of possibly infinite width are linear functionals on the space of measures on the parameter space. From this functional-analytic perspective, graph signals are seen as spatial discretizations of functions on the manifold, which leads to a natural notion of training data consistent across graph resolutions. To enable convergence results, the continuum parameter space is chosen as a weakly compact product of unit balls, with Sobolev regularity imposed on the output weight and bias, but not on the convolutional parameter. The corresponding discrete parameter spaces inherit the corresponding spectral decay, and are additionally restricted by a frequency cutoff adapted to the informative spectral window of the graph Laplacians. Under these assumptions, we prove $Γ$-convergence of regularized empirical risk minimization functionals and corresponding convergence of their global minimizers, in the sense of weak convergence of the parameter measures and uniform convergence of the functions over compact sets. This provides a formalization of mesh and sample independence for the training of such networks.

View on arXiv PDF

Similar