CLLGMar 25

Dialogue to Question Generation for Evidence-based Medical Guideline Agent Development

arXiv:2603.2393721.7h-index: 3
AI Analysis

This addresses the problem of integrating evidence-based guidelines into brief consultations for physicians, though it is incremental as it builds on existing LLM capabilities.

The study tackled the challenge of implementing evidence-based medicine in fast-paced primary care by using large language models to generate guideline-relevant questions during physician-patient encounters, finding that while not fully reliable, they can produce clinically meaningful questions to reduce cognitive burden.

Evidence-based medicine (EBM) is central to high-quality care, but remains difficult to implement in fast-paced primary care settings. Physicians face short consultations, increasing patient loads, and lengthy guideline documents that are impractical to consult in real time. To address this gap, we investigate the feasibility of using large language models (LLMs) as ambient assistants that surface targeted, evidence-based questions during physician-patient encounters. Our study focuses on question generation rather than question answering, with the aim of scaffolding physician reasoning and integrating guideline-based practice into brief consultations. We implemented two prompting strategies, a zero-shot baseline and a multi-stage reasoning variant, using Gemini 2.5 as the backbone model. We evaluated on a benchmark of 80 de-identified transcripts from real clinical encounters, with six experienced physicians contributing over 90 hours of structured review. Results indicate that while general-purpose LLMs are not yet fully reliable, they can produce clinically meaningful and guideline-relevant questions, suggesting significant potential to reduce cognitive burden and make EBM more actionable at the point of care.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes