IVCVJul 2, 2020

4D Spatio-Temporal Convolutional Networks for Object Position Estimation in OCT Volumes

arXiv:2007.01044v13 citations
AI Analysis

This work addresses marker object tracking in surgery using OCT, with incremental improvements by incorporating temporal information.

The paper tackled the problem of object position estimation in OCT volumes for computer-assisted surgery by extending 3D CNNs to 4D spatio-temporal CNNs, resulting in a 30% lower mean absolute error compared to using single volumes.

Tracking and localizing objects is a central problem in computer-assisted surgery. Optical coherence tomography (OCT) can be employed as an optical tracking system, due to its high spatial and temporal resolution. Recently, 3D convolutional neural networks (CNNs) have shown promising performance for pose estimation of a marker object using single volumetric OCT images. While this approach relied on spatial information only, OCT allows for a temporal stream of OCT image volumes capturing the motion of an object at high volumes rates. In this work, we systematically extend 3D CNNs to 4D spatio-temporal CNNs to evaluate the impact of additional temporal information for marker object tracking. Across various architectures, our results demonstrate that using a stream of OCT volumes and employing 4D spatio-temporal convolutions leads to a 30% lower mean absolute error compared to single volume processing with 3D CNNs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes