CVAIApr 25, 2025

A BERT-Style Self-Supervised Learning CNN for Disease Identification from Retinal Images

arXiv:2504.18049v11 citationsh-index: 19
Originality Incremental advance
AI Analysis

This work addresses label scarcity in medical imaging for researchers and clinicians, though it is incremental as it adapts existing self-supervised methods to a CNN framework.

The study tackled the challenge of limited labeled data in medical imaging by developing a BERT-style self-supervised learning approach using a lightweight CNN, pre-trained on unlabeled retinal images, which significantly improved performance in downstream disease identification tasks such as Alzheimer's, Parkinson's, and retinal diseases.

In the field of medical imaging, the advent of deep learning, especially the application of convolutional neural networks (CNNs) has revolutionized the analysis and interpretation of medical images. Nevertheless, deep learning methods usually rely on large amounts of labeled data. In medical imaging research, the acquisition of high-quality labels is both expensive and difficult. The introduction of Vision Transformers (ViT) and self-supervised learning provides a pre-training strategy that utilizes abundant unlabeled data, effectively alleviating the label acquisition challenge while broadening the breadth of data utilization. However, ViT's high computational density and substantial demand for computing power, coupled with the lack of localization characteristics of its operations on image patches, limit its efficiency and applicability in many application scenarios. In this study, we employ nn-MobileNet, a lightweight CNN framework, to implement a BERT-style self-supervised learning approach. We pre-train the network on the unlabeled retinal fundus images from the UK Biobank to improve downstream application performance. We validate the results of the pre-trained model on Alzheimer's disease (AD), Parkinson's disease (PD), and various retinal diseases identification. The results show that our approach can significantly improve performance in the downstream tasks. In summary, this study combines the benefits of CNNs with the capabilities of advanced self-supervised learning in handling large-scale unlabeled data, demonstrating the potential of CNNs in the presence of label scarcity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes