CVLGJul 22, 2020

Endo-Sim2Real: Consistency learning-based domain adaptation for instrument segmentation

arXiv:2007.11514v139 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of expensive manual annotation for real endoscopic videos in computer-assisted interventions, offering a domain adaptation solution that is incremental in leveraging existing methods.

The paper tackles the problem of surgical tool segmentation in endoscopic videos by proposing a consistency-based domain adaptation framework to bridge the performance gap between simulated and real data, showing improved segmentation quality and quantity compared to state-of-the-art solutions on datasets like Cholec80 and EndoVis'15.

Surgical tool segmentation in endoscopic videos is an important component of computer assisted interventions systems. Recent success of image-based solutions using fully-supervised deep learning approaches can be attributed to the collection of big labeled datasets. However, the annotation of a big dataset of real videos can be prohibitively expensive and time consuming. Computer simulations could alleviate the manual labeling problem, however, models trained on simulated data do not generalize to real data. This work proposes a consistency-based framework for joint learning of simulated and real (unlabeled) endoscopic data to bridge this performance generalization issue. Empirical results on two data sets (15 videos of the Cholec80 and EndoVis'15 dataset) highlight the effectiveness of the proposed \emph{Endo-Sim2Real} method for instrument segmentation. We compare the segmentation of the proposed approach with state-of-the-art solutions and show that our method improves segmentation both in terms of quality and quantity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes