CV LGAug 21, 2022

A semi-supervised Teacher-Student framework for surgical tool detection and localization

Mansoor Ali, Gilberto Ochoa-Ruiz, Sharib Ali

arXiv:2208.09926v17.316 citationsh-index: 24Has Code

Originality Incremental advance

AI Analysis

This work addresses the problem of limited annotated data for surgical tool detection, which is incremental as it applies a known SSL approach to a specific domain.

The paper tackles surgical tool detection in minimally invasive surgery by proposing a semi-supervised Teacher-Student framework to address data scarcity and class imbalance, achieving improvements of 8%, 12%, and 27% in mAP over state-of-the-art SSL methods and a fully supervised baseline on 1% labeled data.

Surgical tool detection in minimally invasive surgery is an essential part of computer-assisted interventions. Current approaches are mostly based on supervised methods which require large fully labeled data to train supervised models and suffer from pseudo label bias because of class imbalance issues. However large image datasets with bounding box annotations are often scarcely available. Semi-supervised learning (SSL) has recently emerged as a means for training large models using only a modest amount of annotated data; apart from reducing the annotation cost. SSL has also shown promise to produce models that are more robust and generalizable. Therefore, in this paper we introduce a semi-supervised learning (SSL) framework in surgical tool detection paradigm which aims to mitigate the scarcity of training data and the data imbalance through a knowledge distillation approach. In the proposed work, we train a model with labeled data which initialises the Teacher-Student joint learning, where the Student is trained on Teacher-generated pseudo labels from unlabeled data. We propose a multi-class distance with a margin based classification loss function in the region-of-interest head of the detector to effectively segregate foreground classes from background region. Our results on m2cai16-tool-locations dataset indicate the superiority of our approach on different supervised data settings (1%, 2%, 5%, 10% of annotated data) where our model achieves overall improvements of 8%, 12% and 27% in mAP (on 1% labeled data) over the state-of-the-art SSL methods and a fully supervised baseline, respectively. The code is available at https://github.com/Mansoor-at/Semi-supervised-surgical-tool-det

View on arXiv PDF Code

Similar