Multi-Stage Patient Role-Playing Framework for Realistic Clinical Interactions
This work addresses the need for more authentic and diverse doctor-patient interactions in clinical LLMs and medical education, though it is incremental as it builds on existing simulation methods.
The authors tackled the problem of unrealistic clinical interactions in LLMs by creating a Chinese patient simulation dataset and proposing a multi-stage framework, which significantly improved model performance in emulating patient behavior.
The simulation of realistic clinical interactions plays a pivotal role in advancing clinical Large Language Models (LLMs) and supporting medical diagnostic education. Existing approaches and benchmarks rely on generic or LLM-generated dialogue data, which limits the authenticity and diversity of doctor-patient interactions. In this work, we propose the first Chinese patient simulation dataset (Ch-PatientSim), constructed from realistic clinical interaction scenarios to comprehensively evaluate the performance of models in emulating patient behavior. Patients are simulated based on a five-dimensional persona structure. To address issues of the persona class imbalance, a portion of the dataset is augmented using few-shot generation, followed by manual verification. We evaluate various state-of-the-art LLMs and find that most produce overly formal responses that lack individual personality. To address this limitation, we propose a training-free Multi-Stage Patient Role-Playing (MSPRP) framework, which decomposes interactions into three stages to ensure both personalization and realism in model responses. Experimental results demonstrate that our approach significantly improves model performance across multiple dimensions of patient simulation.