CVDec 18, 2025

Memory-Enhanced SAM3 for Occlusion-Robust Surgical Instrument Segmentation

arXiv:2512.16880v1h-index: 5
Originality Incremental advance
AI Analysis

This work addresses occlusion-robust segmentation for surgical interventions, offering a training-free solution with significant performance gains, though it is incremental as it builds upon the existing SAM3 framework.

The paper tackled the problem of surgical instrument segmentation in endoscopic videos, which is challenging due to occlusions and other artifacts, by proposing ReMeDI-SAM3, a training-free memory-enhanced extension of SAM3 that achieved absolute mcIoU improvements of around 7% on EndoVis17 and 16% on EndoVis18 over vanilla SAM3.

Accurate surgical instrument segmentation in endoscopic videos is crucial for computer-assisted interventions, yet remains challenging due to frequent occlusions, rapid motion, specular artefacts, and long-term instrument re-entry. While SAM3 provides a powerful spatio-temporal framework for video object segmentation, its performance in surgical scenes is limited by indiscriminate memory updates, fixed memory capacity, and weak identity recovery after occlusions. We propose ReMeDI-SAM3, a training-free memory-enhanced extension of SAM3, that addresses these limitations through three components: (i) relevance-aware memory filtering with a dedicated occlusion-aware memory for storing pre-occlusion frames, (ii) a piecewise interpolation scheme that expands the effective memory capacity, and (iii) a feature-based re-identification module with temporal voting for reliable post-occlusion identity disambiguation. Together, these components mitigate error accumulation and enable reliable recovery after occlusions. Evaluations on EndoVis17 and EndoVis18 under a zero-shot setting show absolute mcIoU improvements of around 7% and 16%, respectively, over vanilla SAM3, outperforming even prior training-based approaches. Project page: https://valaybundele.github.io/remedi-sam3/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes