CLFeb 27, 2024

The Foundational Capabilities of Large Language Models in Predicting Postoperative Risks Using Clinical Notes

arXiv:2402.17493v531 citationsh-index: 29npj Digital Medicine
Originality Synthesis-oriented
AI Analysis

This addresses postoperative risk prediction for healthcare providers, but it is incremental as it applies existing LLM methods to a new clinical dataset.

The study tackled predicting postoperative risks from clinical notes using large language models (LLMs), achieving absolute improvements of up to 38.3% in AUROC and 33.2% in AUPRC over traditional methods, with further gains from fine-tuning strategies.

Clinical notes recorded during a patient's perioperative journey holds immense informational value. Advances in large language models (LLMs) offer opportunities for bridging this gap. Using 84,875 pre-operative notes and its associated surgical cases from 2018 to 2021, we examine the performance of LLMs in predicting six postoperative risks using various fine-tuning strategies. Pretrained LLMs outperformed traditional word embeddings by an absolute AUROC of 38.3% and AUPRC of 33.2%. Self-supervised fine-tuning further improved performance by 3.2% and 1.5%. Incorporating labels into training further increased AUROC by 1.8% and AUPRC by 2%. The highest performance was achieved with a unified foundation model, with improvements of 3.6% for AUROC and 2.6% for AUPRC compared to self-supervision, highlighting the foundational capabilities of LLMs in predicting postoperative risks, which could be potentially beneficial when deployed for perioperative care

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes