CVOct 18, 2024

Tackling domain generalization for out-of-distribution endoscopic imaging

Mansoor Ali Teevno, Gilberto Ochoa-Ruiz, Sharib Ali

arXiv:2410.14821v12.0h-index: 4MLMI@MICCAI

Originality Incremental advance

AI Analysis

This addresses the problem of poor generalization in deep learning for surgical scene segmentation across different endoscopic modalities, which is a domain-specific incremental advance.

The paper tackles domain generalization for out-of-distribution endoscopic imaging by proposing a method that uses style and content information with a restitution module, achieving improvements of up to 13.7% over baselines and 8% over SOTA on polyp and Barrett's esophagus datasets.

While recent advances in deep learning (DL) for surgical scene segmentation have yielded promising results on single-center and single-imaging modality data, these methods usually do not generalize well to unseen distributions or modalities. Even though human experts can identify visual appearances, DL methods often fail to do so when data samples do not follow a similar distribution. Current literature addressing domain gaps in modality changes has focused primarily on natural scene data. However, these methods cannot be directly applied to endoscopic data, as visual cues in such data are more limited compared to natural scenes. In this work, we exploit both style and content information in images by performing instance normalization and feature covariance mapping techniques to preserve robust and generalizable feature representations. Additionally, to avoid the risk of removing salient feature representations associated with objects of interest, we introduce a restitution module within the feature-learning ResNet backbone that retains useful task-relevant features. Our proposed method shows a 13.7% improvement over the baseline DeepLabv3+ and nearly an 8% improvement over recent state-of-the-art (SOTA) methods for the target (different modality) set of the EndoUDA polyp dataset. Similarly, our method achieved a 19% improvement over the baseline and 6% over the best-performing SOTA method on the EndoUDA Barrett's esophagus (BE) dataset.

View on arXiv PDF

Similar