CLNov 13, 2019

Adapting and evaluating a deep learning language model for clinical why-question answering

Andrew Wen, Mohamed Y. Elwazir, Sungrim Moon, Jungwei Fan

arXiv:1911.05604v135 citations

Originality Synthesis-oriented

AI Analysis

This addresses clinical information extraction for healthcare professionals, but it is incremental as it applies an existing method to a new domain.

The study tackled the problem of answering why-questions from clinical text by adapting a BERT model, achieving an accuracy of 0.707 (or 0.760 with partial match) and a 6% improvement from clinical language customization.

Objectives: To adapt and evaluate a deep learning language model for answering why-questions based on patient-specific clinical text. Materials and Methods: Bidirectional encoder representations from transformers (BERT) models were trained with varying data sources to perform SQuAD 2.0 style why-question answering (why-QA) on clinical notes. The evaluation focused on: 1) comparing the merits from different training data, 2) error analysis. Results: The best model achieved an accuracy of 0.707 (or 0.760 by partial match). Training toward customization for the clinical language helped increase 6% in accuracy. Discussion: The error analysis suggested that the model did not really perform deep reasoning and that clinical why-QA might warrant more sophisticated solutions. Conclusion: The BERT model achieved moderate accuracy in clinical why-QA and should benefit from the rapidly evolving technology. Despite the identified limitations, it could serve as a competent proxy for question-driven clinical information extraction.

View on arXiv PDF

Similar