LGAIMLMay 15, 2019

IPC: A Benchmark Data Set for Learning with Graph-Structured Data

arXiv:1905.06393v15 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This provides a new, scalable benchmark for researchers in graph machine learning, though it is incremental as it adds to existing benchmark collections.

The authors introduced the IPC benchmark data set, derived from International Planning Competitions, to evaluate graph-based machine learning methods for classification and regression tasks, featuring graphs with large, skewed sizes and directed/acyclic structures that challenge existing models.

Benchmark data sets are an indispensable ingredient of the evaluation of graph-based machine learning methods. We release a new data set, compiled from International Planning Competitions (IPC), for benchmarking graph classification, regression, and related tasks. Apart from the graph construction (based on AI planning problems) that is interesting in its own right, the data set possesses distinctly different characteristics from popularly used benchmarks. The data set, named IPC, consists of two self-contained versions, grounded and lifted, both including graphs of large and skewedly distributed sizes, posing substantial challenges for the computation of graph models such as graph kernels and graph neural networks. The graphs in this data set are directed and the lifted version is acyclic, offering the opportunity of benchmarking specialized models for directed (acyclic) structures. Moreover, the graph generator and the labeling are computer programmed; thus, the data set may be extended easily if a larger scale is desired. The data set is accessible from \url{https://github.com/IBM/IPC-graph-data}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes