CVLGROSep 24, 2020

daVinciNet: Joint Prediction of Motion and Surgical State in Robot-Assisted Surgery

arXiv:2009.11937v126 citations
Originality Incremental advance
AI Analysis

It addresses the need for shared control and supervised autonomy in surgical subtasks, such as suturing or ultrasound scanning, for robot-assisted surgery systems.

This paper tackles the problem of predicting future surgical instrument trajectories and surgical subtask states in robot-assisted surgery, achieving up to 93.85% short-term state prediction accuracy and 1.07mm short-term trajectory prediction error.

This paper presents a technique to concurrently and jointly predict the future trajectories of surgical instruments and the future state(s) of surgical subtasks in robot-assisted surgeries (RAS) using multiple input sources. Such predictions are a necessary first step towards shared control and supervised autonomy of surgical subtasks. Minute-long surgical subtasks, such as suturing or ultrasound scanning, often have distinguishable tool kinematics and visual features, and can be described as a series of fine-grained states with transition schematics. We propose daVinciNet - an end-to-end dual-task model for robot motion and surgical state predictions. daVinciNet performs concurrent end-effector trajectory and surgical state predictions using features extracted from multiple data streams, including robot kinematics, endoscopic vision, and system events. We evaluate our proposed model on an extended Robotic Intra-Operative Ultrasound (RIOUS+) imaging dataset collected on a da Vinci Xi surgical system and the JHU-ISI Gesture and Skill Assessment Working Set (JIGSAWS). Our model achieves up to 93.85% short-term (0.5s) and 82.11% long-term (2s) state prediction accuracy, as well as 1.07mm short-term and 5.62mm long-term trajectory prediction error.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes