CLLGOct 29, 2020

Multiple Sclerosis Severity Classification From Clinical Text

arXiv:2010.15316v1994 citations
Originality Highly original
AI Analysis

This work addresses the challenge of automating MS severity scoring from clinical notes, which typically requires expert interpretation, by providing a more accurate and efficient solution for healthcare professionals.

The researchers tackled the problem of automatically classifying Multiple Sclerosis severity from unstructured clinical text, achieving state-of-the-art performance with a Macro-F1 of 0.88 for EDSS prediction and an average improvement of 0.29 to 0.63 for functional subscores.

Multiple Sclerosis (MS) is a chronic, inflammatory and degenerative neurological disease, which is monitored by a specialist using the Expanded Disability Status Scale (EDSS) and recorded in unstructured text in the form of a neurology consult note. An EDSS measurement contains an overall "EDSS" score and several functional subscores. Typically, expert knowledge is required to interpret consult notes and generate these scores. Previous approaches used limited context length Word2Vec embeddings and keyword searches to predict scores given a consult note, but often failed when scores were not explicitly stated. In this work, we present MS-BERT, the first publicly available transformer model trained on real clinical data other than MIMIC. Next, we present MSBC, a classifier that applies MS-BERT to generate embeddings and predict EDSS and functional subscores. Lastly, we explore combining MSBC with other models through the use of Snorkel to generate scores for unlabelled consult notes. MSBC achieves state-of-the-art performance on all metrics and prediction tasks and outperforms the models generated from the Snorkel ensemble. We improve Macro-F1 by 0.12 (to 0.88) for predicting EDSS and on average by 0.29 (to 0.63) for predicting functional subscores over previous Word2Vec CNN and rule-based approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes