CVFeb 16, 2025

Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review

arXiv:2502.14886v29 citationsh-index: 19
Originality Synthesis-oriented
AI Analysis

It addresses the problem of improving surgical precision and safety for patients and surgeons, but is incremental as it surveys existing advancements.

This paper reviews how foundation AI models and other ML/DL technologies enhance surgical scene understanding in minimally invasive surgery, improving tasks like segmentation and instrument tracking, but notes that more effort is needed for clinical integration.

Recent advancements in machine learning (ML) and deep learning (DL), particularly through the introduction of Foundation Models (FMs), have significantly enhanced surgical scene understanding within minimally invasive surgery (MIS). This paper surveys the integration of state-of-the-art ML and DL technologies, including Convolutional Neural Networks (CNNs), Vision Transformers (ViTs), and Foundation Models like the Segment Anything Model (SAM), into surgical workflows. These technologies improve segmentation accuracy, instrument tracking, and phase recognition in surgical scene understanding. The paper explores the challenges these technologies face, such as data variability and computational demands, and discusses ethical considerations and integration hurdles in clinical settings. Highlighting the roles of FMs, we bridge the technological capabilities with clinical needs and outline future research directions to enhance the adaptability, efficiency, and ethical alignment of AI applications in surgery. Our findings suggest that substantial progress has been made; however, more focused efforts are required to achieve seamless integration of these technologies into clinical workflows, ensuring they complement surgical practice by enhancing precision, reducing risks, and optimizing patient outcomes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes