IV CV LGJan 21, 2023

Pre-text Representation Transfer for Deep Learning with Limited Imbalanced Data : Application to CT-based COVID-19 Detection

Fouzia Altaf, Syed M. S. Islam, Naeem K. Janjua, Naveed Akhtar

arXiv:2301.08888v13.01 citationsh-index: 32

Originality Incremental advance

AI Analysis

This addresses the challenge of training deep learning models with scarce and imbalanced medical data, particularly for COVID-19 detection, though it is incremental as it builds on existing transfer learning methods.

The paper tackles the problem of limited and imbalanced medical image data for disease detection by proposing Pre-text Representation Transfer (PRT), which uses unsupervised pre-text tasks on medical images to improve representation transfer, resulting in consistent gains over conventional transfer learning in CT-based COVID-19 detection across various class-imbalance ratios.

Annotating medical images for disease detection is often tedious and expensive. Moreover, the available training samples for a given task are generally scarce and imbalanced. These conditions are not conducive for learning effective deep neural models. Hence, it is common to 'transfer' neural networks trained on natural images to the medical image domain. However, this paradigm lacks in performance due to the large domain gap between the natural and medical image data. To address that, we propose a novel concept of Pre-text Representation Transfer (PRT). In contrast to the conventional transfer learning, which fine-tunes a source model after replacing its classification layers, PRT retains the original classification layers and updates the representation layers through an unsupervised pre-text task. The task is performed with (original, not synthetic) medical images, without utilizing any annotations. This enables representation transfer with a large amount of training data. This high-fidelity representation transfer allows us to use the resulting model as a more effective feature extractor. Moreover, we can also subsequently perform the traditional transfer learning with this model. We devise a collaborative representation based classification layer for the case when we leverage the model as a feature extractor. We fuse the output of this layer with the predictions of a model induced with the traditional transfer learning performed over our pre-text transferred model. The utility of our technique for limited and imbalanced data classification problem is demonstrated with an extensive five-fold evaluation for three large-scale models, tested for five different class-imbalance ratios for CT based COVID-19 detection. Our results show a consistent gain over the conventional transfer learning with the proposed method.

View on arXiv PDF

Similar