CVNov 14, 2023

GPT-4V(ision) Unsuitable for Clinical Care and Education: A Clinician-Evaluated Assessment

arXiv:2403.12046v113 citationsh-index: 9
Originality Synthesis-oriented
AI Analysis

This highlights the unsuitability of current AI models for clinical care and education, emphasizing caution in medical applications.

The study evaluated GPT-4V's ability to interpret medical images like CT scans and MRIs, finding that its diagnostic accuracy and clinical decision-making were poor, posing risks to patient safety.

OpenAI's large multimodal model, GPT-4V(ision), was recently developed for general image interpretation. However, less is known about its capabilities with medical image interpretation and diagnosis. Board-certified physicians and senior residents assessed GPT-4V's proficiency across a range of medical conditions using imaging modalities such as CT scans, MRIs, ECGs, and clinical photographs. Although GPT-4V is able to identify and explain medical images, its diagnostic accuracy and clinical decision-making abilities are poor, posing risks to patient safety. Despite the potential that large language models may have in enhancing medical education and delivery, the current limitations of GPT-4V in interpreting medical images reinforces the importance of appropriate caution when using it for clinical decision-making.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes