CLMay 30, 2025

LGAR: Zero-Shot LLM-Guided Neural Ranking for Abstract Screening in Systematic Literature Reviews

Christian Jaumann, Andreas Wiedholz, Annemarie Friedrich

arXiv:2505.24757v26.72 citationsh-index: 2Has CodeACL

Originality Highly original

AI Analysis

This work addresses the challenge of efficiently screening abstracts for systematic literature reviews, particularly in the medical domain, by leveraging LLMs to improve ranking accuracy.

The paper tackled the problem of abstract screening in systematic literature reviews by proposing LGAR, a zero-shot LLM-guided neural ranking method, which outperformed existing QA-based methods by 5-10 percentage points in mean average precision.

The scientific literature is growing rapidly, making it hard to keep track of the state-of-the-art. Systematic literature reviews (SLRs) aim to identify and evaluate all relevant papers on a topic. After retrieving a set of candidate papers, the abstract screening phase determines initial relevance. To date, abstract screening methods using large language models (LLMs) focus on binary classification settings; existing question answering (QA) based ranking approaches suffer from error propagation. LLMs offer a unique opportunity to evaluate the SLR's inclusion and exclusion criteria, yet, existing benchmarks do not provide them exhaustively. We manually extract these criteria as well as research questions for 57 SLRs, mostly in the medical domain, enabling principled comparisons between approaches. Moreover, we propose LGAR, a zero-shot LLM Guided Abstract Ranker composed of an LLM based graded relevance scorer and a dense re-ranker. Our extensive experiments show that LGAR outperforms existing QA-based methods by 5-10 pp. in mean average precision. Our code and data is publicly available.

View on arXiv PDF Code

Similar