LGJul 14, 2025

T-GRAB: A Synthetic Diagnostic Benchmark for Learning on Temporal Graphs

arXiv:2507.10183v2h-index: 7Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses a critical gap for researchers in dynamic graph learning by providing a diagnostic tool to uncover limitations in current models, though it is incremental as it focuses on benchmarking rather than proposing new architectures.

The authors tackled the problem of evaluating whether Temporal Graph Neural Networks (TGNNs) effectively capture core temporal patterns like periodicity and cause-and-effect by introducing T-GRAB, a synthetic benchmark with controlled tasks, and found that 11 evaluated methods showed fundamental shortcomings in generalizing these patterns.

Dynamic graph learning methods have recently emerged as powerful tools for modelling relational data evolving through time. However, despite extensive benchmarking efforts, it remains unclear whether current Temporal Graph Neural Networks (TGNNs) effectively capture core temporal patterns such as periodicity, cause-and-effect, and long-range dependencies. In this work, we introduce the Temporal Graph Reasoning Benchmark (T-GRAB), a comprehensive set of synthetic tasks designed to systematically probe the capabilities of TGNNs to reason across time. T-GRAB provides controlled, interpretable tasks that isolate key temporal skills: counting/memorizing periodic repetitions, inferring delayed causal effects, and capturing long-range dependencies over both spatial and temporal dimensions. We evaluate 11 temporal graph learning methods on these tasks, revealing fundamental shortcomings in their ability to generalize temporal patterns. Our findings offer actionable insights into the limitations of current models, highlight challenges hidden by traditional real-world benchmarks, and motivate the development of architectures with stronger temporal reasoning abilities. The code for T-GRAB can be found at: https://github.com/alirezadizaji/T-GRAB.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes