LGNov 23, 2025

FOS: A Large-Scale Temporal Graph Benchmark for Scientific Interdisciplinary Link Prediction

arXiv:2511.18631v11 citations
Originality Incremental advance
AI Analysis

This provides a reproducible benchmark for researchers in AI and science studies to advance prediction of scientific frontiers, though it is incremental as it builds on existing temporal graph methods.

The authors tackled the challenge of forecasting interdisciplinary scientific breakthroughs by introducing FOS, a large-scale temporal graph benchmark for predicting new field-pair linkages, showing that embedding textual descriptions boosts accuracy and that top-ranked predictions align with future academic publications.

Interdisciplinary scientific breakthroughs mostly emerge unexpectedly, and forecasting the formation of novel research fields remains a major challenge. We introduce FOS (Future Of Science), a comprehensive time-aware graph-based benchmark that reconstructs annual co-occurrence graphs of 65,027 research sub-fields (spanning 19 general domains) over the period 1827-2024. In these graphs, edges denote the co-occurrence of two fields in a single publication and are timestamped with the corresponding publication year. Nodes are enriched with semantic embeddings, and edges are characterized by temporal and topological descriptors. We formulate the prediction of new field-pair linkages as a temporal link-prediction task, emphasizing the "first-time" connections that signify pioneering interdisciplinary directions. Through extensive experiments, we evaluate a suite of state-of-the-art temporal graph architectures under multiple negative-sampling regimes and show that (i) embedding long-form textual descriptions of fields significantly boosts prediction accuracy, and (ii) distinct model classes excel under different evaluation settings. Case analyses show that top-ranked link predictions on FOS align with field pairings that emerge in subsequent years of academic publications. We publicly release FOS, along with its temporal data splits and evaluation code, to establish a reproducible benchmark for advancing research in predicting scientific frontiers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes