CLLGMLJun 4, 2021

CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes

arXiv:2106.02524v1716 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of information loss in clinical settings for physicians and caregivers, but it is incremental as it builds on existing datasets and methods.

The authors tackled the problem of extracting action items from lengthy hospital discharge notes to improve continuity of care by creating CLIP, a dataset of 718 documents with physician annotations covering 100K sentences, and showed that models using in-domain pre-training and contextual information achieved the best performance.

Continuity of care is crucial to ensuring positive health outcomes for patients discharged from an inpatient hospital setting, and improved information sharing can help. To share information, caregivers write discharge notes containing action items to share with patients and their future caregivers, but these action items are easily lost due to the lengthiness of the documents. In this work, we describe our creation of a dataset of clinical action items annotated over MIMIC-III, the largest publicly available dataset of real clinical notes. This dataset, which we call CLIP, is annotated by physicians and covers 718 documents representing 100K sentences. We describe the task of extracting the action items from these documents as multi-aspect extractive summarization, with each aspect representing a type of action to be taken. We evaluate several machine learning models on this task, and show that the best models exploit in-domain language model pre-training on 59K unannotated documents, and incorporate context from neighboring sentences. We also propose an approach to pre-training data selection that allows us to explore the trade-off between size and domain-specificity of pre-training datasets for this task.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes