CVAIAug 25, 2025

Hierarchical Vision-Language Learning for Medical Out-of-Distribution Detection

arXiv:2508.17667v1h-index: 6Has CodeMICCAI
Originality Incremental advance
AI Analysis

This addresses the risk of misdiagnosis in medical diagnosis systems by improving detection of unknown diseases, though it appears incremental as it builds on existing vision-language models.

The study tackled the problem of detecting unknown diseases in medical images by proposing a hierarchical vision-language learning framework, achieving superior out-of-distribution detection performance on three public datasets compared to existing methods.

In trustworthy medical diagnosis systems, integrating out-of-distribution (OOD) detection aims to identify unknown diseases in samples, thereby mitigating the risk of misdiagnosis. In this study, we propose a novel OOD detection framework based on vision-language models (VLMs), which integrates hierarchical visual information to cope with challenging unknown diseases that resemble known diseases. Specifically, a cross-scale visual fusion strategy is proposed to couple visual embeddings from multiple scales. This enriches the detailed representation of medical images and thus improves the discrimination of unknown diseases. Moreover, a cross-scale hard pseudo-OOD sample generation strategy is proposed to benefit OOD detection maximally. Experimental evaluations on three public medical datasets support that the proposed framework achieves superior OOD detection performance compared to existing methods. The source code is available at https://openi.pcl.ac.cn/OpenMedIA/HVL.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes