CLDLFeb 20, 2025

Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs

arXiv:2502.14561v36 citationsh-index: 18TPDL
Originality Incremental advance
AI Analysis

This addresses the problem of citation intent prediction for researchers and practitioners in academic domains, offering an incremental improvement over existing methods.

This work tackled the problem of predicting citation intent using open Large Language Models (LLMs) through in-context learning and fine-tuning, achieving relative F1-score improvements of 8% on the SciCite dataset and 4.3% on the ACL-ARC dataset compared to baselines.

This work investigates the ability of open Large Language Models (LLMs) to predict citation intent through in-context learning and fine-tuning. Unlike traditional approaches relying on domain-specific pre-trained models like SciBERT, we demonstrate that general-purpose LLMs can be adapted to this task with minimal task-specific data. We evaluate twelve model variations across five prominent open LLM families using zero-, one-, few-, and many-shot prompting. Our experimental study identifies the top-performing model and prompting parameters through extensive in-context learning experiments. We then demonstrate the significant impact of task-specific adaptation by fine-tuning this model, achieving a relative F1-score improvement of 8% on the SciCite dataset and 4.3% on the ACL-ARC dataset compared to the instruction-tuned baseline. These findings provide valuable insights for model selection and prompt engineering. Additionally, we make our end-to-end evaluation framework and models openly available for future use.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes