LGAICLApr 21

Statistics, Not Scale: Modular Medical Dialogue with Bayesian Belief Engine

arXiv:2604.2002288.6h-index: 22
AI Analysis

This addresses the need for more reliable and auditable diagnostic tools in healthcare, offering a novel architectural solution rather than an incremental improvement.

The paper tackles the problem of conflating natural-language communication and probabilistic reasoning in LLMs used as autonomous diagnostic agents by introducing BMBE, a modular framework that separates these capabilities, resulting in improved privacy, cost-effectiveness, and robustness compared to frontier LLMs.

Large language models are increasingly deployed as autonomous diagnostic agents, yet they conflate two fundamentally different capabilities: natural-language communication and probabilistic reasoning. We argue that this conflation is an architectural flaw, not an engineering shortcoming. We introduce BMBE (Bayesian Medical Belief Engine), a modular diagnostic dialogue framework that enforces a strict separation between language and reasoning: an LLM serves only as a sensor, parsing patient utterances into structured evidence and verbalising questions, while all diagnostic inference resides in a deterministic, auditable Bayesian engine. Because patient data never enters the LLM, the architecture is private by construction; because the statistical backend is a standalone module, it can be replaced per target population without retraining. This separation yields three properties no autonomous LLM can offer: calibrated selective diagnosis with a continuously adjustable accuracy-coverage tradeoff, a statistical separation gap where even a cheap sensor paired with the engine outperforms a frontier standalone model from the same family at a fraction of the cost, and robustness to adversarial patient communication styles that cause standalone doctors to collapse. We validate across empirical and LLM-generated knowledge bases against frontier LLMs, confirming the advantage is architectural, not informational.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes