LGJun 15, 2022

Taxonomy of Benchmarks in Graph Representation Learning

Renming Liu, Semih Cantürk, Frederik Wenkel, Sarah McGuire, Xinyi Wang, Anna Little, Leslie O'Bray, Michael Perlmutter, Bastian Rieck, Matthew Hirn, Guy Wolf, Ladislav Rampášek

MILA

arXiv:2206.07729v411.816 citationsh-index: 29Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses the need for better benchmark selection and evaluation in graph representation learning, which is incremental as it builds on existing benchmarking practices.

The authors tackled the problem of unclear benchmarking in graph neural networks (GNNs) by developing a taxonomy based on sensitivity profiles to graph perturbations, revealing which data characteristics GNNs leverage.

Graph Neural Networks (GNNs) extend the success of neural networks to graph-structured data by accounting for their intrinsic geometry. While extensive research has been done on developing GNN models with superior performance according to a collection of graph representation learning benchmarks, it is currently not well understood what aspects of a given model are probed by them. For example, to what extent do they test the ability of a model to leverage graph structure vs. node features? Here, we develop a principled approach to taxonomize benchmarking datasets according to a $\textit{sensitivity profile}$ that is based on how much GNN performance changes due to a collection of graph perturbations. Our data-driven analysis provides a deeper understanding of which benchmarking data characteristics are leveraged by GNNs. Consequently, our taxonomy can aid in selection and development of adequate graph benchmarks, and better informed evaluation of future GNN methods. Finally, our approach and implementation in $\texttt{GTaxoGym}$ package are extendable to multiple graph prediction task types and future datasets.

View on arXiv PDF Code

Similar