CVJul 20, 2023

Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data

arXiv:2307.10698v217 citationsh-index: 24
Originality Highly original
AI Analysis

This addresses the challenge of data scarcity in medical imaging for disease monitoring, offering a novel training approach that could benefit researchers in this domain.

The paper tackles the problem of training large transformer models for retinal image matching with limited annotated data by proposing reverse knowledge distillation, where a small CNN model trains a larger vision transformer, improving generalization and achieving better results on a public dataset.

Retinal image matching plays a crucial role in monitoring disease progression and treatment response. However, datasets with matched keypoints between temporally separated pairs of images are not available in abundance to train transformer-based model. We propose a novel approach based on reverse knowledge distillation to train large models with limited data while preventing overfitting. Firstly, we propose architectural modifications to a CNN-based semi-supervised method called SuperRetina that help us improve its results on a publicly available dataset. Then, we train a computationally heavier model based on a vision transformer encoder using the lighter CNN-based model, which is counter-intuitive in the field knowledge-distillation research where training lighter models based on heavier ones is the norm. Surprisingly, such reverse knowledge distillation improves generalization even further. Our experiments suggest that high-dimensional fitting in representation space may prevent overfitting unlike training directly to match the final output. We also provide a public dataset with annotations for retinal image keypoint detection and matching to help the research community develop algorithms for retinal image applications.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes