CVLGNov 23, 2025

Functional Localization Enforced Deep Anomaly Detection Using Fundus Images

arXiv:2511.18627v1
Originality Incremental advance
AI Analysis

This addresses reliable detection of retinal diseases like diabetic retinopathy and age-related macular degeneration for medical diagnosis, with incremental improvements over existing methods.

The study tackled retinal disease detection from fundus images by evaluating a Vision Transformer classifier across multiple datasets, achieving accuracies of 0.789-0.843 and an AUC of 0.91 on one dataset, and developed a GANomaly-based anomaly detector with an AUC of 0.76.

Reliable detection of retinal diseases from fundus images is challenged by the variability in imaging quality, subtle early-stage manifestations, and domain shift across datasets. In this study, we systematically evaluated a Vision Transformer (ViT) classifier under multiple augmentation and enhancement strategies across several heterogeneous public datasets, as well as the AEyeDB dataset, a high-quality fundus dataset created in-house and made available for the research community. The ViT demonstrated consistently strong performance, with accuracies ranging from 0.789 to 0.843 across datasets and diseases. Diabetic retinopathy and age-related macular degeneration were detected reliably, whereas glaucoma remained the most frequently misclassified disease. Geometric and color augmentations provided the most stable improvements, while histogram equalization benefited datasets dominated by structural subtlety. Laplacian enhancement reduced performance across different settings. On the Papila dataset, the ViT with geometric augmentation achieved an AUC of 0.91, outperforming previously reported convolutional ensemble baselines (AUC of 0.87), underscoring the advantages of transformer architectures and multi-dataset training. To complement the classifier, we developed a GANomaly-based anomaly detector, achieving an AUC of 0.76 while providing inherent reconstruction-based explainability and robust generalization to unseen data. Probabilistic calibration using GUESS enabled threshold-independent decision support for future clinical implementation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes