CVAIIVMay 21, 2025

Domain Adaptive Skin Lesion Classification via Conformal Ensemble of Vision Transformers

arXiv:2505.15997v13 citationsh-index: 6ACL
Originality Incremental advance
AI Analysis

This addresses the need for trustworthy medical imaging systems by enhancing domain adaptation and robustness, though it is incremental as it builds on existing conformal prediction and ensemble methods.

The paper tackled the problem of unreliable uncertainty estimates in deep learning models for skin lesion classification under domain shifts, and the result was a framework that achieved a 90.38% coverage rate, improving by 9.95% over a baseline model.

Exploring the trustworthiness of deep learning models is crucial, especially in critical domains such as medical imaging decision support systems. Conformal prediction has emerged as a rigorous means of providing deep learning models with reliable uncertainty estimates and safety guarantees. However, conformal prediction results face challenges due to the backbone model's struggles in domain-shifted scenarios, such as variations in different sources. To aim this challenge, this paper proposes a novel framework termed Conformal Ensemble of Vision Transformers (CE-ViTs) designed to enhance image classification performance by prioritizing domain adaptation and model robustness, while accounting for uncertainty. The proposed method leverages an ensemble of vision transformer models in the backbone, trained on diverse datasets including HAM10000, Dermofit, and Skin Cancer ISIC datasets. This ensemble learning approach, calibrated through the combined mentioned datasets, aims to enhance domain adaptation through conformal learning. Experimental results underscore that the framework achieves a high coverage rate of 90.38\%, representing an improvement of 9.95\% compared to the HAM10000 model. This indicates a strong likelihood that the prediction set includes the true label compared to singular models. Ensemble learning in CE-ViTs significantly improves conformal prediction performance, increasing the average prediction set size for challenging misclassified samples from 1.86 to 3.075.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes