CV AISep 2, 2022

Artifact-Tolerant Clustering-Guided Contrastive Embedding Learning for Ophthalmic Images

Min Shi, Anagha Lokhande, Mojtaba S. Fazli, Vishal Sharma, Yu Tian, Yan Luo, Louis R. Pasquale, Tobias Elze, Michael V. Boland, Nazlee Zebardast, David S. Friedman, Lucy Q. Shen

HarvardStanford

arXiv:2209.00773v11.42 citationsh-index: 33Has Code

Originality Incremental advance

AI Analysis

This work addresses the problem of improving computer-aided diagnosis for eye diseases like glaucoma, but it appears incremental as it builds on existing contrastive learning and artifact correction techniques.

The paper tackled the problem of learning meaningful features from ophthalmic images for disease diagnosis, which is challenging due to anatomical variations and artifacts, and proposed an unsupervised framework called EyeLearn that achieved effective results in visual field prediction and glaucoma detection as verified by experiments.

Ophthalmic images and derivatives such as the retinal nerve fiber layer (RNFL) thickness map are crucial for detecting and monitoring ophthalmic diseases (e.g., glaucoma). For computer-aided diagnosis of eye diseases, the key technique is to automatically extract meaningful features from ophthalmic images that can reveal the biomarkers (e.g., RNFL thinning patterns) linked to functional vision loss. However, representation learning from ophthalmic images that links structural retinal damage with human vision loss is non-trivial mostly due to large anatomical variations between patients. The task becomes even more challenging in the presence of image artifacts, which are common due to issues with image acquisition and automated segmentation. In this paper, we propose an artifact-tolerant unsupervised learning framework termed EyeLearn for learning representations of ophthalmic images. EyeLearn has an artifact correction module to learn representations that can best predict artifact-free ophthalmic images. In addition, EyeLearn adopts a clustering-guided contrastive learning strategy to explicitly capture the intra- and inter-image affinities. During training, images are dynamically organized in clusters to form contrastive samples in which images in the same or different clusters are encouraged to learn similar or dissimilar representations, respectively. To evaluate EyeLearn, we use the learned representations for visual field prediction and glaucoma detection using a real-world ophthalmic image dataset of glaucoma patients. Extensive experiments and comparisons with state-of-the-art methods verified the effectiveness of EyeLearn for learning optimal feature representations from ophthalmic images.

View on arXiv PDF Code

Similar