CVJun 20, 2025

Trustworthy Few-Shot Transfer of Medical VLMs through Split Conformal Prediction

arXiv:2506.17503v13 citationsh-index: 50MICCAI
Originality Incremental advance
AI Analysis

This work addresses reliability issues in medical AI applications, offering a domain-specific solution for trustworthy few-shot transfer learning.

The paper tackles the problem of ensuring trustworthiness in few-shot transfer of medical vision-language models by proposing a novel pipeline, transductive split conformal adaptation (SCA-T), which improves efficiency and conditional coverage compared to standard split conformal prediction while maintaining empirical guarantees.

Medical vision-language models (VLMs) have demonstrated unprecedented transfer capabilities and are being increasingly adopted for data-efficient image classification. Despite its growing popularity, its reliability aspect remains largely unexplored. This work explores the split conformal prediction (SCP) framework to provide trustworthiness guarantees when transferring such models based on a small labeled calibration set. Despite its potential, the generalist nature of the VLMs' pre-training could negatively affect the properties of the predicted conformal sets for specific tasks. While common practice in transfer learning for discriminative purposes involves an adaptation stage, we observe that deploying such a solution for conformal purposes is suboptimal since adapting the model using the available calibration data breaks the rigid exchangeability assumptions for test data in SCP. To address this issue, we propose transductive split conformal adaptation (SCA-T), a novel pipeline for transfer learning on conformal scenarios, which performs an unsupervised transductive adaptation jointly on calibration and test data. We present comprehensive experiments utilizing medical VLMs across various image modalities, transfer tasks, and non-conformity scores. Our framework offers consistent gains in efficiency and conditional coverage compared to SCP, maintaining the same empirical guarantees.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes