CLApr 14, 2025

Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol

arXiv:2504.10284v37 citationsh-index: 15Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of summarizing and comparing scientific papers for researchers, but it is incremental as it builds on prior work to improve evaluation and methods.

The paper tackles the problem of generating literature review tables from scientific papers to meet user needs, introducing the ARXIV2TABLE benchmark and showing that current LLMs struggle with this task, highlighting its difficulty.

Literature review tables are essential for summarizing and comparing collections of scientific papers. We explore the task of generating tables that best fulfill a user's informational needs given a collection of scientific papers. Building on recent work (Newman et al., 2024), we extend prior approaches to address real-world complexities through a combination of LLM-based methods and human annotations. Our contributions focus on three key challenges encountered in real-world use: (i) User prompts are often under-specified; (ii) Retrieved candidate papers frequently contain irrelevant content; and (iii) Task evaluation should move beyond shallow text similarity techniques and instead assess the utility of inferred tables for information-seeking tasks (e.g., comparing papers). To support reproducible evaluation, we introduce ARXIV2TABLE, a more realistic and challenging benchmark for this task, along with a novel approach to improve literature review table generation in real-world scenarios. Our extensive experiments on this benchmark show that both open-weight and proprietary LLMs struggle with the task, highlighting its difficulty and the need for further advancements. Our dataset and code are available at https://github.com/JHU-CLSP/arXiv2Table.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes