CVAILGApr 4, 2023

EPVT: Environment-aware Prompt Vision Transformer for Domain Generalization in Skin Lesion Recognition

arXiv:2304.01508v316 citationsh-index: 55Has Code
Originality Incremental advance
AI Analysis

This addresses domain generalization for skin lesion recognition, which is crucial for real-world deployment, but it is incremental as it builds on existing prompt-based and domain generalization techniques.

The paper tackles the problem of poor generalization in skin lesion recognition due to reliance on irrelevant image artifacts by proposing EPVT, a domain generalization method using prompts in vision transformers, which shows superior performance on out-of-distribution and biased datasets.

Skin lesion recognition using deep learning has made remarkable progress, and there is an increasing need for deploying these systems in real-world scenarios. However, recent research has revealed that deep neural networks for skin lesion recognition may overly depend on disease-irrelevant image artifacts (i.e., dark corners, dense hairs), leading to poor generalization in unseen environments. To address this issue, we propose a novel domain generalization method called EPVT, which involves embedding prompts into the vision transformer to collaboratively learn knowledge from diverse domains. Concretely, EPVT leverages a set of domain prompts, each of which plays as a domain expert, to capture domain-specific knowledge; and a shared prompt for general knowledge over the entire dataset. To facilitate knowledge sharing and the interaction of different prompts, we introduce a domain prompt generator that enables low-rank multiplicative updates between domain prompts and the shared prompt. A domain mixup strategy is additionally devised to reduce the co-occurring artifacts in each domain, which allows for more flexible decision margins and mitigates the issue of incorrectly assigned domain labels. Experiments on four out-of-distribution datasets and six different biased ISIC datasets demonstrate the superior generalization ability of EPVT in skin lesion recognition across various environments. Code is avaliable at https://github.com/SiyuanYan1/EPVT.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes