CVJun 8, 2018

Unsupervised Learning for Surgical Motion by Learning to Predict the Future

arXiv:1806.03318v119 citations
AI Analysis

This work addresses the challenge of unsupervised representation learning for surgical motion analysis, which is incremental as it builds on existing methods with specific improvements.

The paper tackles the problem of learning meaningful representations of surgical motion without supervision by predicting future motion, achieving a state-of-the-art F1 score of 0.77 ± 0.05 for information retrieval in suturing activity.

We show that it is possible to learn meaningful representations of surgical motion, without supervision, by learning to predict the future. An architecture that combines an RNN encoder-decoder and mixture density networks (MDNs) is developed to model the conditional distribution over future motion given past motion. We show that the learned encodings naturally cluster according to high-level activities, and we demonstrate the usefulness of these learned encodings in the context of information retrieval, where a database of surgical motion is searched for suturing activity using a motion-based query. Future prediction with MDNs is found to significantly outperform simpler baselines as well as the best previously-published result for this task, advancing state-of-the-art performance from an F1 score of 0.60 +- 0.14 to 0.77 +- 0.05.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes