CLMay 14

The Scientific Contribution Graph: Automated Literature-based Technological Roadmapping at Scale

arXiv:2605.1501113.8
Predicted impact top 70% in CL · last 90 daysOriginality Incremental advance
AI Analysis

For researchers in AI/NLP, this provides a new resource and task for automated scientific discovery and impact assessment.

The authors formulate automated technological roadmapping as extracting scientific contributions and linking them to prerequisites, constructing a large-scale AI/NLP-domain resource with 2 million contributions from 230k papers connected by 12.5 million prerequisite edges. They introduce scientific prerequisite prediction, achieving 0.48 MAP with temporally filtered backtesting.

Scientific contributions rarely develop in isolation, but instead build upon prior discoveries. We formulate the task of automated technological roadmapping as extracting scientific contributions from scholarly articles and linking them to their prerequisites. We present the Scientific Contribution Graph, a large-scale AI/NLP-domain resource containing 2 million detailed scientific contributions extracted from 230k open-access papers and connected by 12.5 million prerequisite edges. We further introduce scientific prerequisite prediction, a scientific discovery task in which models predict which existing technologies can enable future discoveries, and show that contemporary models are rapidly improving on this task, reaching 0.48 MAP when evaluated using temporally filtered backtesting. We anticipate technological roadmapping resources such as this will support scientific impact assessment and automated scientific discovery.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes