CVAug 30, 2024

BTMuda: A Bi-level Multi-source unsupervised domain adaptation framework for breast cancer diagnosis

Yuxiang Yang, Xinyi Zeng, Pinxian Zeng, Binyu Yan, Xi Wu, Jiliu Zhou, Yan Wang

arXiv:2408.17054v12.06 citationsh-index: 9

Originality Incremental advance

AI Analysis

This work addresses domain adaptation challenges for breast cancer diagnosis, which is an incremental improvement over existing methods by incorporating multiple sources and bi-level shift handling.

The paper tackles domain shift problems in breast cancer diagnosis by proposing BTMuda, a bi-level multi-source unsupervised domain adaptation framework that addresses intra-domain and inter-domain shifts using a three-branch mixed extractor with CNN and Transformer components. The method achieves state-of-the-art performance on three public mammographic datasets.

Deep learning has revolutionized the early detection of breast cancer, resulting in a significant decrease in mortality rates. However, difficulties in obtaining annotations and huge variations in distribution between training sets and real scenes have limited their clinical applications. To address these limitations, unsupervised domain adaptation (UDA) methods have been used to transfer knowledge from one labeled source domain to the unlabeled target domain, yet these approaches suffer from severe domain shift issues and often ignore the potential benefits of leveraging multiple relevant sources in practical applications. To address these limitations, in this work, we construct a Three-Branch Mixed extractor and propose a Bi-level Multi-source unsupervised domain adaptation method called BTMuda for breast cancer diagnosis. Our method addresses the problems of domain shift by dividing domain shift issues into two levels: intra-domain and inter-domain. To reduce the intra-domain shift, we jointly train a CNN and a Transformer as two paths of a domain mixed feature extractor to obtain robust representations rich in both low-level local and high-level global information. As for the inter-domain shift, we redesign the Transformer delicately to a three-branch architecture with cross-attention and distillation, which learns domain-invariant representations from multiple domains. Besides, we introduce two alignment modules - one for feature alignment and one for classifier alignment - to improve the alignment process. Extensive experiments conducted on three public mammographic datasets demonstrate that our BTMuda outperforms state-of-the-art methods.

View on arXiv PDF

Similar