CLOct 30, 2019

Phenotyping of Clinical Notes with Improved Document Classification Models Using Contextualized Neural Language Models

arXiv:1910.13664v239 citations
AI Analysis

This work addresses phenotyping from clinical notes for healthcare applications, representing an incremental improvement by applying existing language models to a specific domain.

The paper tackled the problem of extracting patient conditions from clinical notes by exploring BERT-based architectures, which removed the need for manual engineering and achieved competitive or superior performance compared to state-of-the-art methods on two phenotyping tasks.

Clinical notes contain an extensive record of a patient's health status, such as smoking status or the presence of heart conditions. However, this detail is not replicated within the structured data of electronic health systems. Phenotyping, the extraction of patient conditions from free clinical text, is a critical task which supports avariety of downstream applications such as decision support and secondary use of medical records. Previous work has resulted in systems which are high performing but require hand engineering, often of rules. Recent work in pretrained contextualized language models have enabled advances in representing text for a variety of tasks. We therefore explore several architectures for modeling pheno-typing that rely solely on BERT representations of the clinical note, removing the need for manual engineering. We find these architectures are competitive with or outperform existing state of the art methods on two phenotyping tasks.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes