AIDec 15, 2025

MedCEG: Reinforcing Verifiable Medical Reasoning with Critical Evidence Graph

arXiv:2512.13510v12 citationsh-index: 11Has Code
Originality Incremental advance
AI Analysis

This addresses the need for transparent and reliable AI reasoning in clinical decision-making for physicians, representing a solid but incremental advancement in medical AI.

The paper tackles the problem of limited clinical reliability in medical reasoning by large language models, proposing MedCEG to reinforce verifiable reasoning with a Critical Evidence Graph, resulting in performance surpassing existing methods while producing clinically valid reasoning chains.

Large language models with reasoning capabilities have demonstrated impressive performance across a wide range of domains. In clinical applications, a transparent, step-by-step reasoning process provides physicians with strong evidence to support decision-making. While reinforcement learning has effectively enhanced reasoning performance in medical contexts, the clinical reliability of these reasoning processes remains limited because their accuracy and validity are often overlooked during training. To address this gap, we propose MedCEG, a framework that augments medical language models with clinically valid reasoning pathways by explicitly supervising the reasoning process through a Critical Evidence Graph (CEG). We curate a dataset of challenging clinical cases and algorithmically construct a CEG for each sample to represent a high-quality verifiable reasoning pathway. To guide the reasoning process, we introduce a Clinical Reasoning Procedure Reward, which evaluates Node Coverage, Structural Correctness, and Chain Completeness, thereby providing a holistic assessment of reasoning quality. Experimental results show that MedCEG surpasses existing methods in performance while producing clinically valid reasoning chains, representing a solid advancement in reliable medical AI reasoning. The code and models are available at https://github.com/LinjieMu/MedCEG.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes