CVLGROJul 17, 2020

Synthetic and Real Inputs for Tool Segmentation in Robotic Surgery

arXiv:2007.09107v266 citations
AI Analysis

This work addresses the challenge of robust tool segmentation for surgical scene understanding and robotic automation, but it is incremental as it builds on existing deep learning methods with a new dataset and multi-modal approach.

The paper tackles the problem of semantic tool segmentation in robotic surgery videos by proposing a deep learning model that processes both laparoscopic and simulation images, and introduces a new dataset using the da Vinci Research Kit to address the lack of labeled data.

Semantic tool segmentation in surgical videos is important for surgical scene understanding and computer-assisted interventions as well as for the development of robotic automation. The problem is challenging because different illumination conditions, bleeding, smoke and occlusions can reduce algorithm robustness. At present labelled data for training deep learning models is still lacking for semantic surgical instrument segmentation and in this paper we show that it may be possible to use robot kinematic data coupled with laparoscopic images to alleviate the labelling problem. We propose a new deep learning based model for parallel processing of both laparoscopic and simulation images for robust segmentation of surgical tools. Due to the lack of laparoscopic frames annotated with both segmentation ground truth and kinematic information a new custom dataset was generated using the da Vinci Research Kit (dVRK) and is made available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes