CLMar 26, 2025

Explainable ICD Coding via Entity Linking

arXiv:2503.20508v212 citationsh-index: 3Proceedings of the Second Workshop on Patient-Oriented Language Processing (CL4Health)
Originality Incremental advance
AI Analysis

This addresses the need for explainable coding in healthcare to support medical coders, though it is incremental as it builds on existing LLM methods.

The paper tackles the problem of providing explicit evidence for automated clinical coding by reframing it as an entity linking task, achieving effective disambiguation and strong performance in few-shot scenarios using parameter-efficient fine-tuning of LLMs with constrained decoding.

Clinical coding is a critical task in healthcare, although traditional methods for automating clinical coding may not provide sufficient explicit evidence for coders in production environments. This evidence is crucial, as medical coders have to make sure there exists at least one explicit passage in the input health record that justifies the attribution of a code. We therefore propose to reframe the task as an entity linking problem, in which each document is annotated with its set of codes and respective textual evidence, enabling better human-machine collaboration. By leveraging parameter-efficient fine-tuning of Large Language Models (LLMs), together with constrained decoding, we introduce three approaches to solve this problem that prove effective at disambiguating clinical mentions and that perform well in few-shot scenarios.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes