ML LGJan 22, 2024

VC dimension of Graph Neural Networks with Pfaffian activation functions

Giuseppe Alessio D'Inverno, Monica Bianchini, Franco Scarselli

arXiv:2401.12362v210.75 citationsh-index: 28Has CodeNeural Networks

Originality Incremental advance

AI Analysis

This work addresses theoretical generalization bounds for GNNs with common activation functions, which is incremental as it extends prior analysis from piecewise polynomial functions.

The authors tackled the problem of bounding the VC dimension of Graph Neural Networks (GNNs) for activation functions like sigmoid and hyperbolic tangent, using Pfaffian function theory, and provided bounds based on architecture parameters and graph properties, supported by preliminary experiments.

Graph Neural Networks (GNNs) have emerged in recent years as a powerful tool to learn tasks across a wide range of graph domains in a data-driven fashion; based on a message passing mechanism, GNNs have gained increasing popularity due to their intuitive formulation, closely linked with the Weisfeiler-Lehman (WL) test for graph isomorphism, to which they have proven equivalent. From a theoretical point of view, GNNs have been shown to be universal approximators, and their generalization capability (namely, bounds on the Vapnik Chervonekis (VC) dimension) has recently been investigated for GNNs with piecewise polynomial activation functions. The aim of our work is to extend this analysis on the VC dimension of GNNs to other commonly used activation functions, such as sigmoid and hyperbolic tangent, using the framework of Pfaffian function theory. Bounds are provided with respect to architecture parameters (depth, number of neurons, input size) as well as with respect to the number of colors resulting from the 1-WL test applied on the graph domain. The theoretical analysis is supported by a preliminary experimental study.

View on arXiv PDF Code

Similar