TO LGOct 1, 2025

Neu-RadBERT for Enhanced Diagnosis of Brain Injuries and Conditions

Manpreet Singh, Sean Macrae, Pierre-Marc Williams, Nicole Hung, Sabrina Araujo de Franca, Laurent Letourneau-Guillon, François-Martin Carrier, Bang Liu, Yiorgos Alexandros Cavayas

arXiv:2510.06232v11.2h-index: 16

Originality Incremental advance

AI Analysis

This work addresses the need for automated diagnosis extraction from radiology reports in healthcare, offering a domain-specific tool for research and patient care, though it is incremental as it builds on existing BERT-based methods.

The researchers tackled the problem of extracting diagnoses from free-text radiology reports for brain injuries in patients with acute respiratory failure, developing Neu-RadBERT, which achieved up to 98.0% accuracy for acute brain injuries, outperforming baseline models and Llama-2-13B at 67.5%.

Objective: We sought to develop a classification algorithm to extract diagnoses from free-text radiology reports of brain imaging performed in patients with acute respiratory failure (ARF) undergoing invasive mechanical ventilation. Methods: We developed and fine-tuned Neu-RadBERT, a BERT-based model, to classify unstructured radiology reports. We extracted all the brain imaging reports (computed tomography and magnetic resonance imaging) from MIMIC-IV database, performed in patients with ARF. Initial manual labelling was performed on a subset of reports for various brain abnormalities, followed by fine-tuning Neu-RadBERT using three strategies: 1) baseline RadBERT, 2) Neu-RadBERT with Masked Language Modeling (MLM) pretraining, and 3) Neu-RadBERT with MLM pretraining and oversampling to address data skewness. We compared the performance of this model to Llama-2-13B, an autoregressive LLM. Results: The Neu-RadBERT model, particularly with oversampling, demonstrated significant improvements in diagnostic accuracy compared to baseline RadBERT for brain abnormalities, achieving up to 98.0% accuracy for acute brain injuries. Llama-2-13B exhibited relatively lower performance, peaking at 67.5% binary classification accuracy. This result highlights potential limitations of current autoregressive LLMs for this specific classification task, though it remains possible that larger models or further fine-tuning could improve performance. Conclusion: Neu-RadBERT, enhanced through target domain pretraining and oversampling techniques, offered a robust tool for accurate and reliable diagnosis of neurological conditions from radiology reports. This study underscores the potential of transformer-based NLP models in automatically extracting diagnoses from free text reports with potential applications to both research and patient care.

View on arXiv PDF

Similar