CLSep 26, 2024
Evaluation of Large Language Models for Summarization Tasks in the Medical Domain: A Narrative ReviewEmma Croxford, Yanjun Gao, Nicholas Pellegrino et al.
Large Language Models have advanced clinical Natural Language Generation, creating opportunities to manage the volume of medical text. However, the high-stakes nature of medicine requires reliable evaluation, which remains a challenge. In this narrative review, we assess the current evaluation state for clinical summarization tasks and propose future directions to address the resource constraints of expert human evaluation.