CV AI IV SPJan 25, 2022

Virtual Adversarial Training for Semi-supervised Breast Mass Classification

Xuxin Chen, Ximin Wang, Ke Zhang, Kar-Ming Fung, Theresa C. Thai, Kathleen Moore, Robert S. Mannel, Hong Liu, Bin Zheng, Yuchen Qiu

arXiv:2201.10675v11.46 citations

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of limited labeled data for medical image analysis, specifically for breast mass classification, by leveraging unlabeled data to improve accuracy, though it is incremental as it applies an existing method to a new domain.

The study tackled breast mass classification in mammograms by using virtual adversarial training (VAT) in a semi-supervised learning approach, achieving classification accuracies of 0.740 and 0.760 with 40% and 80% labeled data, respectively.

This study aims to develop a novel computer-aided diagnosis (CAD) scheme for mammographic breast mass classification using semi-supervised learning. Although supervised deep learning has achieved huge success across various medical image analysis tasks, its success relies on large amounts of high-quality annotations, which can be challenging to acquire in practice. To overcome this limitation, we propose employing a semi-supervised method, i.e., virtual adversarial training (VAT), to leverage and learn useful information underlying in unlabeled data for better classification of breast masses. Accordingly, our VAT-based models have two types of losses, namely supervised and virtual adversarial losses. The former loss acts as in supervised classification, while the latter loss aims at enhancing model robustness against virtual adversarial perturbation, thus improving model generalizability. To evaluate the performance of our VAT-based CAD scheme, we retrospectively assembled a total of 1024 breast mass images, with equal number of benign and malignant masses. A large CNN and a small CNN were used in this investigation, and both were trained with and without the adversarial loss. When the labeled ratios were 40% and 80%, VAT-based CNNs delivered the highest classification accuracy of 0.740 and 0.760, respectively. The experimental results suggest that the VAT-based CAD scheme can effectively utilize meaningful knowledge from unlabeled data to better classify mammographic breast mass images.

View on arXiv PDF

Similar