CLFeb 27, 2024

The Foundational Capabilities of Large Language Models in Predicting Postoperative Risks Using Clinical Notes

Charles Alba, Bing Xue, Joanna Abraham, Thomas Kannampallil, Chenyang Lu

arXiv:2402.17493v55.537 citationsh-index: 68Has Codenpj Digital Medicine

Originality Synthesis-oriented

AI Analysis

This addresses postoperative risk prediction for healthcare providers, but it is incremental as it applies existing LLM methods to a new clinical dataset.

The study tackled predicting postoperative risks from clinical notes using large language models (LLMs), achieving absolute improvements of up to 38.3% in AUROC and 33.2% in AUPRC over traditional methods, with further gains from fine-tuning strategies.

Clinical notes recorded during a patient's perioperative journey holds immense informational value. Advances in large language models (LLMs) offer opportunities for bridging this gap. Using 84,875 pre-operative notes and its associated surgical cases from 2018 to 2021, we examine the performance of LLMs in predicting six postoperative risks using various fine-tuning strategies. Pretrained LLMs outperformed traditional word embeddings by an absolute AUROC of 38.3% and AUPRC of 33.2%. Self-supervised fine-tuning further improved performance by 3.2% and 1.5%. Incorporating labels into training further increased AUROC by 1.8% and AUPRC by 2%. The highest performance was achieved with a unified foundation model, with improvements of 3.6% for AUROC and 2.6% for AUPRC compared to self-supervision, highlighting the foundational capabilities of LLMs in predicting postoperative risks, which could be potentially beneficial when deployed for perioperative care

View on arXiv PDF Code

Similar