CLOct 11, 2024

MedMobile: A mobile-sized language model with clinical capabilities

arXiv:2410.09019v22 citationsh-index: 6BMJ Digital Health & AI
Originality Incremental advance
AI Analysis

This work addresses barriers to deploying language models in medicine, offering a mobile-sized solution that could democratize access, though it is incremental in adapting an existing model.

The authors tackled the problem of high computational costs and privacy concerns in medical language models by developing MedMobile, a 3.8 billion parameter model that runs on mobile devices, achieving 75.7% on MedQA (USMLE) and surpassing the passing mark for physicians.

Language models (LMs) have demonstrated expert-level reasoning and recall abilities in medicine. However, computational costs and privacy concerns are mounting barriers to wide-scale implementation. To address these significant limitations, we introduce a parsimonious adaptation of phi-3-mini, MedMobile, a 3.8 billion parameter LM capable of running on a mobile device, for medical applications. We perform a careful set of pipeline additions and demonstrate that chain of thought, ensembling, and fine-tuning lead to the greatest performance gains, while unexpectedly retrieval augmented generation fails to demonstrate significant improvements. We evaluate the efficiency of our pipeline on the MultiMedQA and MedBullets. We demonstrate that MedMobile scores 75.7% on the MedQA (USMLE), surpassing the passing mark for licensed physicians (~60%) and rivaling scores of models 100 times its size. Across the entirety of the MultiMedQA, MedMobile achieves SOTA performance for models with less than 5B parameters and represents the smallest model to pass the MedQA (USMLE). MedMobile holds promise to democratize access to language models in medicine, bolstering lower compute needs and fast inference speeds. With the ability to combat the biggest barriers to entry for language models in medicine, we hope that MedMobile is a critical step forward in developing clinically relevant language models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes