CVNov 22, 2021

Exploring Segment-level Semantics for Online Phase Recognition from Surgical Videos

arXiv:2111.11044v353 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the problem of improving accuracy in surgical phase recognition for robot-assisted surgeries, representing an incremental advancement over existing methods.

The paper tackled surgical phase recognition from videos by proposing a segment-attentive hierarchical consistency network (SAHC) that learns segment-level semantics to refine frame-wise predictions, achieving state-of-the-art results on the M2CAI16 and Cholec80 datasets.

Automatic surgical phase recognition plays a vital role in robot-assisted surgeries. Existing methods ignored a pivotal problem that surgical phases should be classified by learning segment-level semantics instead of solely relying on frame-wise information. This paper presents a segment-attentive hierarchical consistency network (SAHC) for surgical phase recognition from videos. The key idea is to extract hierarchical high-level semantic-consistent segments and use them to refine the erroneous predictions caused by ambiguous frames. To achieve it, we design a temporal hierarchical network to generate hierarchical high-level segments. Then, we introduce a hierarchical segment-frame attention module to capture relations between the low-level frames and high-level segments. By regularizing the predictions of frames and their corresponding segments via a consistency loss, the network can generate semantic-consistent segments and then rectify the misclassified predictions caused by ambiguous low-level frames. We validate SAHC on two public surgical video datasets, i.e., the M2CAI16 challenge dataset and the Cholec80 dataset. Experimental results show that our method outperforms previous state-of-the-arts and ablation studies prove the effectiveness of our proposed modules. Our code has been released at: https://github.com/xmed-lab/SAHC.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes