CLAICVLGAug 1, 2025

Medical Reasoning in the Era of LLMs: A Systematic Review of Enhancement Techniques and Applications

arXiv:2508.00669v112 citationsh-index: 8
Originality Synthesis-oriented
AI Analysis

It addresses the gap in systematic reasoning for LLMs in medicine, which is crucial for clinical practice, but is incremental as a review rather than a novel method.

This paper provides the first systematic review of techniques to enhance large language models (LLMs) for medical reasoning, analyzing 60 studies from 2022-2025 to categorize methods and applications in areas like diagnosis and treatment planning.

The proliferation of Large Language Models (LLMs) in medicine has enabled impressive capabilities, yet a critical gap remains in their ability to perform systematic, transparent, and verifiable reasoning, a cornerstone of clinical practice. This has catalyzed a shift from single-step answer generation to the development of LLMs explicitly designed for medical reasoning. This paper provides the first systematic review of this emerging field. We propose a taxonomy of reasoning enhancement techniques, categorized into training-time strategies (e.g., supervised fine-tuning, reinforcement learning) and test-time mechanisms (e.g., prompt engineering, multi-agent systems). We analyze how these techniques are applied across different data modalities (text, image, code) and in key clinical applications such as diagnosis, education, and treatment planning. Furthermore, we survey the evolution of evaluation benchmarks from simple accuracy metrics to sophisticated assessments of reasoning quality and visual interpretability. Based on an analysis of 60 seminal studies from 2022-2025, we conclude by identifying critical challenges, including the faithfulness-plausibility gap and the need for native multimodal reasoning, and outlining future directions toward building efficient, robust, and sociotechnically responsible medical AI.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes