AIIVNov 12, 2024

Leveraging Multimodal Models for Enhanced Neuroimaging Diagnostics in Alzheimer's Disease

arXiv:2411.07871v15 citationsh-index: 1BigData
Originality Synthesis-oriented
AI Analysis

This addresses a gap in neuroimaging diagnostics for Alzheimer's disease, though it is incremental as it applies existing multimodal models to a new medical domain.

The paper tackled the lack of diagnostic reports for Alzheimer's disease neuroimaging by generating synthetic reports using GPT-4o-mini on the OASIS-4 dataset, and then using these to train models that generated reports from images, achieving BLEU-4, ROUGE-L, and METEOR scores of 0.1827, 0.3719, and 0.4163 respectively.

The rapid advancements in Large Language Models (LLMs) and Vision-Language Models (VLMs) have shown great potential in medical diagnostics, particularly in radiology, where datasets such as X-rays are paired with human-generated diagnostic reports. However, a significant research gap exists in the neuroimaging field, especially for conditions such as Alzheimer's disease, due to the lack of comprehensive diagnostic reports that can be utilized for model fine-tuning. This paper addresses this gap by generating synthetic diagnostic reports using GPT-4o-mini on structured data from the OASIS-4 dataset, which comprises 663 patients. Using the synthetic reports as ground truth for training and validation, we then generated neurological reports directly from the images in the dataset leveraging the pre-trained BiomedCLIP and T5 models. Our proposed method achieved a BLEU-4 score of 0.1827, ROUGE-L score of 0.3719, and METEOR score of 0.4163, revealing its potential in generating clinically relevant and accurate diagnostic reports.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes