CVSPJun 26, 2025

MedPrompt: LLM-CNN Fusion with Weight Routing for Medical Image Segmentation and Classification

arXiv:2506.21199v1h-index: 1
Originality Incremental advance
AI Analysis

This addresses the need for scalable, user-friendly medical imaging systems for clinicians and researchers, though it appears incremental as it builds on existing LLM and CNN components.

The paper tackles the problem of inflexible, task-specific medical image analysis systems by introducing MedPrompt, a unified framework that combines an LLM for task planning with modular CNNs for image processing, achieving 97% correctness in prompt-driven execution and competitive performance on segmentation and classification tasks.

Current medical image analysis systems are typically task-specific, requiring separate models for classification and segmentation, and lack the flexibility to support user-defined workflows. To address these challenges, we introduce MedPrompt, a unified framework that combines a few-shot prompted Large Language Model (Llama-4-17B) for high-level task planning with a modular Convolutional Neural Network (DeepFusionLab) for low-level image processing. The LLM interprets user instructions and generates structured output to dynamically route task-specific pretrained weights. This weight routing approach avoids retraining the entire framework when adding new tasks-only task-specific weights are required, enhancing scalability and deployment. We evaluated MedPrompt across 19 public datasets, covering 12 tasks spanning 5 imaging modalities. The system achieves a 97% end-to-end correctness in interpreting and executing prompt-driven instructions, with an average inference latency of 2.5 seconds, making it suitable for near real-time applications. DeepFusionLab achieves competitive segmentation accuracy (e.g., Dice 0.9856 on lungs) and strong classification performance (F1 0.9744 on tuberculosis). Overall, MedPrompt enables scalable, prompt-driven medical imaging by combining the interpretability of LLMs with the efficiency of modular CNNs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes