IVCVJan 6, 2022

A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration

arXiv:2201.02242v15 citations
Originality Highly original
AI Analysis

This addresses the need for automated multi-modal image registration to support ophthalmologists in diagnosing retinal diseases, representing an incremental improvement with a novel method for a known bottleneck.

The paper tackles the problem of aligning vessel structures in multi-modal retinal images for ophthalmological diagnosis by developing a convolutional neural network for keypoint detection and description, achieving the best registration performance on their and a public dataset compared to competing methods.

Ophthalmological imaging utilizes different imaging systems, such as color fundus, infrared, fluorescein angiography, optical coherence tomography (OCT) or OCT angiography. Multiple images with different modalities or acquisition times are often analyzed for the diagnosis of retinal diseases. Automatically aligning the vessel structures in the images by means of multi-modal registration can support the ophthalmologists in their work. Our method uses a convolutional neural network to extract features of the vessel structure in multi-modal retinal images. We jointly train a keypoint detection and description network on small patches using a classification and a cross-modal descriptor loss function and apply the network to the full image size in the test phase. Our method demonstrates the best registration performance on our and a public multi-modal dataset in comparison to competing methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes