CLFeb 19, 2025

UM_FHS at TREC 2024 PLABA: Exploration of Fine-tuning and AI agent approach for plain language adaptations of biomedical text

Primoz Kocbek, Leon Kopitar, Zhihong Zhang, Emirhan Aydin, Maxim Topaz, Gregor Stiglic

arXiv:2502.14144v12.7h-index: 11TREC

Originality Synthesis-oriented

AI Analysis

This work addresses making biomedical text accessible for young students, but it is incremental as it applies existing AI methods to a new domain-specific task.

The paper tackled simplifying biomedical abstracts for K8 students by testing prompt engineering, a two-AI agent approach, and fine-tuning with OpenAI models, finding that prompt engineering with gpt-4o-mini outperformed other methods in qualitative metrics like simplicity and accuracy.

This paper describes our submissions to the TREC 2024 PLABA track with the aim to simplify biomedical abstracts for a K8-level audience (13-14 years old students). We tested three approaches using OpenAI's gpt-4o and gpt-4o-mini models: baseline prompt engineering, a two-AI agent approach, and fine-tuning. Adaptations were evaluated using qualitative metrics (5-point Likert scales for simplicity, accuracy, completeness, and brevity) and quantitative readability scores (Flesch-Kincaid grade level, SMOG Index). Results indicated that the two-agent approach and baseline prompt engineering with gpt-4o-mini models show superior qualitative performance, while fine-tuned models excelled in accuracy and completeness but were less simple. The evaluation results demonstrated that prompt engineering with gpt-4o-mini outperforms iterative improvement strategies via two-agent approach as well as fine-tuning with gpt-4o. We intend to expand our investigation of the results and explore advanced evaluations.

View on arXiv PDF

Similar