CLLGNov 27, 2025

ResearchArcade: Graph Interface for Academic Tasks

arXiv:2511.22036v1
Originality Incremental advance
AI Analysis

This addresses the need for a unified data interface to support machine learning models for academic tasks, potentially accelerating knowledge discovery for researchers, though it appears incremental as it builds on existing data sources and methods.

The paper tackles the problem of fragmented academic data sources by introducing ResearchArcade, a graph-based interface that unifies data from sources like ArXiv and OpenReview, and experiments show it improves performance across six academic tasks by leveraging cross-source and multi-modal information.

Academic research generates diverse data sources, and as researchers increasingly use machine learning to assist research tasks, a crucial question arises: Can we build a unified data interface to support the development of machine learning models for various academic tasks? Models trained on such a unified interface can better support human researchers throughout the research process, eventually accelerating knowledge discovery. In this work, we introduce ResearchArcade, a graph-based interface that connects multiple academic data sources, unifies task definitions, and supports a wide range of base models to address key academic challenges. ResearchArcade utilizes a coherent multi-table format with graph structures to organize data from different sources, including academic corpora from ArXiv and peer reviews from OpenReview, while capturing information with multiple modalities, such as text, figures, and tables. ResearchArcade also preserves temporal evolution at both the manuscript and community levels, supporting the study of paper revisions as well as broader research trends over time. Additionally, ResearchArcade unifies diverse academic task definitions and supports various models with distinct input requirements. Our experiments across six academic tasks demonstrate that combining cross-source and multi-modal information enables a broader range of tasks, while incorporating graph structures consistently improves performance over baseline methods. This highlights the effectiveness of ResearchArcade and its potential to advance research progress.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes