CL AIApr 3, 2025

Task as Context Prompting for Accurate Medical Symptom Coding Using Large Language Models

Chengyang He, Wenlong Zhang, Violet Xinying Chen, Yue Ning, Ping Wang

arXiv:2504.03051v12.7h-index: 22Has CodeCHASE

Originality Incremental advance

AI Analysis

This work addresses a critical need in pharmacovigilance and safety monitoring by improving symptom coding from vaccine reports, though it is incremental as it builds on existing LLM methods for a specific domain.

The paper tackles the problem of accurate medical symptom coding from unstructured clinical text by proposing TACO Prompting, a framework that unifies extraction and linking tasks using LLMs, and introduces the SYMPCODER dataset; evaluation across multiple LLMs shows improved flexibility and accuracy for this tailored task.

Accurate medical symptom coding from unstructured clinical text, such as vaccine safety reports, is a critical task with applications in pharmacovigilance and safety monitoring. Symptom coding, as tailored in this study, involves identifying and linking nuanced symptom mentions to standardized vocabularies like MedDRA, differentiating it from broader medical coding tasks. Traditional approaches to this task, which treat symptom extraction and linking as independent workflows, often fail to handle the variability and complexity of clinical narratives, especially for rare cases. Recent advancements in Large Language Models (LLMs) offer new opportunities but face challenges in achieving consistent performance. To address these issues, we propose Task as Context (TACO) Prompting, a novel framework that unifies extraction and linking tasks by embedding task-specific context into LLM prompts. Our study also introduces SYMPCODER, a human-annotated dataset derived from Vaccine Adverse Event Reporting System (VAERS) reports, and a two-stage evaluation framework to comprehensively assess both symptom linking and mention fidelity. Our comprehensive evaluation of multiple LLMs, including Llama2-chat, Jackalope-7b, GPT-3.5 Turbo, GPT-4 Turbo, and GPT-4o, demonstrates TACO's effectiveness in improving flexibility and accuracy for tailored tasks like symptom coding, paving the way for more specific coding tasks and advancing clinical text processing methodologies.

View on arXiv PDF Code

Similar