LGCLAug 15, 2022

Entity Anchored ICD Coding

arXiv:2208.07444v17 citationsh-index: 15
Originality Incremental advance
AI Analysis

This addresses the problem of efficient and accurate medical coding for healthcare systems, though it appears incremental as it builds on existing NLP approaches with specific modifications.

The paper tackles the challenge of assigning ICD codes to medical notes by limiting model inputs to windows around medical entities and building contextualized representations of both codes and entities, achieving superior performance on rare and unseen codes compared to existing methods.

Medical coding is a complex task, requiring assignment of a subset of over 72,000 ICD codes to a patient's notes. Modern natural language processing approaches to these tasks have been challenged by the length of the input and size of the output space. We limit our model inputs to a small window around medical entities found in our documents. From those local contexts, we build contextualized representations of both ICD codes and entities, and aggregate over these representations to form document-level predictions. In contrast to existing methods which use a representation fixed either in size or by codes seen in training, we represent ICD codes by encoding the code description with local context. We discuss metrics appropriate to deploying coding systems in practice. We show that our approach is superior to existing methods in both standard and deployable measures, including performance on rare and unseen codes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes