CVLGNov 5, 2017

Label-driven weakly-supervised learning for multimodal deformable image registration

arXiv:1711.01666v2162 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of fast and robust multimodal image registration for intraoperative applications in medical imaging, representing a novel approach but with incremental improvements in methodology.

The paper tackles the problem of aligning medical images from different modalities by proposing a weakly-supervised, label-driven method that learns 3D voxel correspondence from anatomical labels, achieving a median target registration error of 4.2 mm and a median Dice score of 0.88 on prostate cancer patient data.

Spatially aligning medical images from different modalities remains a challenging task, especially for intraoperative applications that require fast and robust algorithms. We propose a weakly-supervised, label-driven formulation for learning 3D voxel correspondence from higher-level label correspondence, thereby bypassing classical intensity-based image similarity measures. During training, a convolutional neural network is optimised by outputting a dense displacement field (DDF) that warps a set of available anatomical labels from the moving image to match their corresponding counterparts in the fixed image. These label pairs, including solid organs, ducts, vessels, point landmarks and other ad hoc structures, are only required at training time and can be spatially aligned by minimising a cross-entropy function of the warped moving label and the fixed label. During inference, the trained network takes a new image pair to predict an optimal DDF, resulting in a fully-automatic, label-free, real-time and deformable registration. For interventional applications where large global transformation prevails, we also propose a neural network architecture to jointly optimise the global- and local displacements. Experiment results are presented based on cross-validating registrations of 111 pairs of T2-weighted magnetic resonance images and 3D transrectal ultrasound images from prostate cancer patients with a total of over 4000 anatomical labels, yielding a median target registration error of 4.2 mm on landmark centroids and a median Dice of 0.88 on prostate glands.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes