LGJun 15, 2022

Taxonomy of Benchmarks in Graph Representation Learning

MILA
arXiv:2206.07729v416 citationsh-index: 29
Originality Synthesis-oriented
AI Analysis

This work addresses the need for better benchmark selection and evaluation in graph representation learning, which is incremental as it builds on existing benchmarking practices.

The authors tackled the problem of unclear benchmarking in graph neural networks (GNNs) by developing a taxonomy based on sensitivity profiles to graph perturbations, revealing which data characteristics GNNs leverage.

Graph Neural Networks (GNNs) extend the success of neural networks to graph-structured data by accounting for their intrinsic geometry. While extensive research has been done on developing GNN models with superior performance according to a collection of graph representation learning benchmarks, it is currently not well understood what aspects of a given model are probed by them. For example, to what extent do they test the ability of a model to leverage graph structure vs. node features? Here, we develop a principled approach to taxonomize benchmarking datasets according to a $\textit{sensitivity profile}$ that is based on how much GNN performance changes due to a collection of graph perturbations. Our data-driven analysis provides a deeper understanding of which benchmarking data characteristics are leveraged by GNNs. Consequently, our taxonomy can aid in selection and development of adequate graph benchmarks, and better informed evaluation of future GNN methods. Finally, our approach and implementation in $\texttt{GTaxoGym}$ package are extendable to multiple graph prediction task types and future datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes