CVAIAug 26, 2022

Segmentation of Parotid Gland Tumors Using Multimodal MRI and Contrastive Learning

arXiv:2208.12413v2h-index: 10
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of accurate tumor segmentation for medical professionals in head and neck oncology, but it is incremental as it applies existing contrastive learning techniques to a specific domain.

The paper tackled the challenging segmentation of parotid gland tumors from multimodal MRI by using a Transformer-based contrastive learning method with transfer learning, resulting in improved performance with metrics like DSC of 89.60% and HD of 2.98 compared to supervised learning baselines.

Parotid gland tumor is a common type of head and neck tumor. Segmentation of the parotid glands and tumors by MR images is important for the treatment of parotid gland tumors. However, segmentation of the parotid glands is particularly challenging due to their variable shape and low contrast with surrounding structures. Recently deep learning has developed rapidly, which can handle complex problems. However, most of the current deep learning methods for processing medical images are still based on supervised learning. Compared with natural images, medical images are difficult to acquire and costly to label. Contrastive learning, as an unsupervised learning method, can more effectively utilize unlabeled medical images. In this paper, we used a Transformer-based contrastive learning method and innovatively trained the contrastive learning network with transfer learning. Then, the output model was transferred to the downstream parotid segmentation task, which improved the performance of the parotid segmentation model on the test set. The improved DSC was 89.60%, MPA was 99.36%, MIoU was 85.11%, and HD was 2.98. All four metrics showed significant improvement compared to the results of using a supervised learning model as a pre-trained model for the parotid segmentation network. In addition, we found that the improvement of the segmentation network by the contrastive learning model was mainly in the encoder part, so this paper also tried to build a contrastive learning network for the decoder part and discussed the problems encountered in the process of building.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes