CV AIAug 5, 2023

Semi-supervised Learning for Segmentation of Bleeding Regions in Video Capsule Endoscopy

Hechen Li, Yanan Wu, Long Bai, An Wang, Tong Chen, Hongliang Ren

arXiv:2308.02869v15.96 citationsh-index: 67

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of costly and time-consuming annotation for gastrointestinal bleeding diagnosis, though it appears incremental as it applies an existing SSL method to a specific medical domain.

The paper tackled the problem of reducing the need for large annotated datasets in segmenting bleeding regions in video capsule endoscopy images by using a semi-supervised learning approach, achieving accurate identification without compromising performance.

In the realm of modern diagnostic technology, video capsule endoscopy (VCE) is a standout for its high efficacy and non-invasive nature in diagnosing various gastrointestinal (GI) conditions, including obscure bleeding. Importantly, for the successful diagnosis and treatment of these conditions, accurate recognition of bleeding regions in VCE images is crucial. While deep learning-based methods have emerged as powerful tools for the automated analysis of VCE images, they often demand large training datasets with comprehensive annotations. Acquiring these labeled datasets tends to be time-consuming, costly, and requires significant domain expertise. To mitigate this issue, we have embraced a semi-supervised learning (SSL) approach for the bleeding regions segmentation within VCE. By adopting the `Mean Teacher' method, we construct a student U-Net equipped with an scSE attention block, alongside a teacher model of the same architecture. These models' parameters are alternately updated throughout the training process. We use the Kvasir-Capsule dataset for our experiments, which encompasses various GI bleeding conditions. Notably, we develop the segmentation annotations for this dataset ourselves. The findings from our experiments endorse the efficacy of the SSL-based segmentation strategy, demonstrating its capacity to reduce reliance on large volumes of annotations for model training, without compromising on the accuracy of identification.

View on arXiv PDF

Similar