CL DLFeb 20, 2025

Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs

Paris Koloveas, Serafeim Chatzopoulos, Thanasis Vergoulis, Christos Tryfonopoulos

arXiv:2502.14561v39.66 citationsh-index: 18Has CodeTPDL

Originality Incremental advance

AI Analysis

This addresses the problem of citation intent prediction for researchers and practitioners in academic domains, offering an incremental improvement over existing methods.

This work tackled the problem of predicting citation intent using open Large Language Models (LLMs) through in-context learning and fine-tuning, achieving relative F1-score improvements of 8% on the SciCite dataset and 4.3% on the ACL-ARC dataset compared to baselines.

This work investigates the ability of open Large Language Models (LLMs) to predict citation intent through in-context learning and fine-tuning. Unlike traditional approaches relying on domain-specific pre-trained models like SciBERT, we demonstrate that general-purpose LLMs can be adapted to this task with minimal task-specific data. We evaluate twelve model variations across five prominent open LLM families using zero-, one-, few-, and many-shot prompting. Our experimental study identifies the top-performing model and prompting parameters through extensive in-context learning experiments. We then demonstrate the significant impact of task-specific adaptation by fine-tuning this model, achieving a relative F1-score improvement of 8% on the SciCite dataset and 4.3% on the ACL-ARC dataset compared to the instruction-tuned baseline. These findings provide valuable insights for model selection and prompt engineering. Additionally, we make our end-to-end evaluation framework and models openly available for future use.

View on arXiv PDF Code

Similar