IVCVLGApr 15, 2022

Y-Net: A Spatiospectral Dual-Encoder Networkfor Medical Image Segmentation

arXiv:2204.07613v250 citationsh-index: 94
AI Analysis

This work addresses medical image segmentation for retinal OCT analysis, offering incremental improvements over existing methods like U-Net.

The paper tackled automated segmentation of retinal OCT images by introducing Y-Net, a dual-encoder network combining spectral and spatial domain features, resulting in a 13% improvement in fluid segmentation dice score and a 1.9% average dice score gain compared to U-Net.

Automated segmentation of retinal optical coherence tomography (OCT) images has become an important recent direction in machine learning for medical applications. We hypothesize that the anatomic structure of layers and their high-frequency variation in OCT images make retinal OCT a fitting choice for extracting spectral-domain features and combining them with spatial domain features. In this work, we present $Υ$-Net, an architecture that combines the frequency domain features with the image domain to improve the segmentation performance of OCT images. The results of this work demonstrate that the introduction of two branches, one for spectral and one for spatial domain features, brings a very significant improvement in fluid segmentation performance and allows outperformance as compared to the well-known U-Net model. Our improvement was 13% on the fluid segmentation dice score and 1.9% on the average dice score. Finally, removing selected frequency ranges in the spectral domain demonstrates the impact of these features on the fluid segmentation outperformance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes