AICLCVHCMAMar 17, 2025

MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways

arXiv:2503.13205v19 citationsh-index: 5
Originality Highly original
AI Analysis

This work addresses the problem of complex inpatient pathway decision-making for clinicians, representing a novel method for a known bottleneck in medical AI.

The paper tackled the lack of AI systems for inpatient clinical decision-making by developing the IPDS benchmark from MIMIC-IV data and proposing the MAP multi-agent framework, which improved diagnosis accuracy by 25.10% over HuatuoGPT2-13B and outperformed board-certified clinicians by 10%-12% in clinical compliance.

Inpatient pathways demand complex clinical decision-making based on comprehensive patient information, posing critical challenges for clinicians. Despite advancements in large language models (LLMs) in medical applications, limited research focused on artificial intelligence (AI) inpatient pathways systems, due to the lack of large-scale inpatient datasets. Moreover, existing medical benchmarks typically concentrated on medical question-answering and examinations, ignoring the multifaceted nature of clinical decision-making in inpatient settings. To address these gaps, we first developed the Inpatient Pathway Decision Support (IPDS) benchmark from the MIMIC-IV database, encompassing 51,274 cases across nine triage departments and 17 major disease categories alongside 16 standardized treatment options. Then, we proposed the Multi-Agent Inpatient Pathways (MAP) framework to accomplish inpatient pathways with three clinical agents, including a triage agent managing the patient admission, a diagnosis agent serving as the primary decision maker at the department, and a treatment agent providing treatment plans. Additionally, our MAP framework includes a chief agent overseeing the inpatient pathways to guide and promote these three clinician agents. Extensive experiments showed our MAP improved the diagnosis accuracy by 25.10% compared to the state-of-the-art LLM HuatuoGPT2-13B. It is worth noting that our MAP demonstrated significant clinical compliance, outperforming three board-certified clinicians by 10%-12%, establishing a foundation for inpatient pathways systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes