CVAIMar 19

Mind the Rarities: Can Rare Skin Diseases Be Reliably Diagnosed via Diagnostic Reasoning?

arXiv:2603.1841899.1h-index: 7
Predicted impact top 1% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses a gap in reliable diagnosis for rare dermatological conditions, which is critical for clinical applications but incremental as it builds on existing benchmarks.

The authors tackled the problem of evaluating diagnostic reasoning for rare skin diseases in large vision-language models by constructing DermCase, a benchmark with 26,030 multi-modal pairs and 6,354 challenging cases, revealing significant deficiencies in current models' accuracy and reasoning.

Large vision-language models (LVLMs) demonstrate strong performance in dermatology; however, evaluating diagnostic reasoning for rare conditions remains largely unexplored. Existing benchmarks focus on common diseases and assess only final accuracy, overlooking the clinical reasoning process, which is critical for complex cases. We address this gap by constructing DermCase, a long-context benchmark derived from peer-reviewed case reports. Our dataset contains 26,030 multi-modal image-text pairs and 6,354 clinically challenging cases, each annotated with comprehensive clinical information and step-by-step reasoning chains. To enable reliable evaluation, we establish DermLIP-based similarity metrics that achieve stronger alignment with dermatologists for assessing differential diagnosis quality. Benchmarking 22 leading LVLMs exposes significant deficiencies across diagnosis accuracy, differential diagnosis, and clinical reasoning. Fine-tuning experiments demonstrate that instruction tuning substantially improves performance while Direct Preference Optimization (DPO) yields minimal gains. Systematic error analysis further reveals critical limitations in current models' reasoning capabilities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes