IVAICVMar 25, 2025

Wavelet-based Global-Local Interaction Network with Cross-Attention for Multi-View Diabetic Retinopathy Detection

arXiv:2503.19329v1h-index: 19Has CodeICME
Originality Incremental advance
AI Analysis

This work addresses the problem of incomplete lesion detection in diabetic retinopathy for medical imaging applications, representing an incremental improvement over existing multi-view methods.

The paper tackles the challenge of multi-view diabetic retinopathy detection by addressing variable lesion sizes and scattered locations, and introduces a two-branch network with wavelet-based features and cross-view fusion, achieving effective results on large public datasets.

Multi-view diabetic retinopathy (DR) detection has recently emerged as a promising method to address the issue of incomplete lesions faced by single-view DR. However, it is still challenging due to the variable sizes and scattered locations of lesions. Furthermore, existing multi-view DR methods typically merge multiple views without considering the correlations and redundancies of lesion information across them. Therefore, we propose a novel method to overcome the challenges of difficult lesion information learning and inadequate multi-view fusion. Specifically, we introduce a two-branch network to obtain both local lesion features and their global dependencies. The high-frequency component of the wavelet transform is used to exploit lesion edge information, which is then enhanced by global semantic to facilitate difficult lesion learning. Additionally, we present a cross-view fusion module to improve multi-view fusion and reduce redundancy. Experimental results on large public datasets demonstrate the effectiveness of our method. The code is open sourced on https://github.com/HuYongting/WGLIN.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes