CVJul 25, 2023

CosSIF: Cosine similarity-based image filtering to overcome low inter-class variation in synthetic medical image datasets

Mominul Islam, Hasib Zunair, Nabeel Mohammed

arXiv:2307.13842v26.822 citationsh-index: 18Has Code

Originality Incremental advance

AI Analysis

This work addresses a specific problem in medical image analysis for researchers and practitioners, offering incremental improvements in dataset quality for GAN-based synthetic image generation.

The paper tackles the challenge of low inter-class variation in synthetic medical image datasets by proposing CosSIF, a cosine similarity-based filtering algorithm, which when applied with methods like FAGT or FBGT, leads to performance gains such as a 1.59% increase in sensitivity and 1.88% in AUC on the ISIC-2016 dataset.

Crafting effective deep learning models for medical image analysis is a complex task, particularly in cases where the medical image dataset lacks significant inter-class variation. This challenge is further aggravated when employing such datasets to generate synthetic images using generative adversarial networks (GANs), as the output of GANs heavily relies on the input data. In this research, we propose a novel filtering algorithm called Cosine Similarity-based Image Filtering (CosSIF). We leverage CosSIF to develop two distinct filtering methods: Filtering Before GAN Training (FBGT) and Filtering After GAN Training (FAGT). FBGT involves the removal of real images that exhibit similarities to images of other classes before utilizing them as the training dataset for a GAN. On the other hand, FAGT focuses on eliminating synthetic images with less discriminative features compared to real images used for training the GAN. Experimental results reveal that employing either the FAGT or FBGT method with modern transformer and convolutional-based networks leads to substantial performance gains in various evaluation metrics. FAGT implementation on the ISIC-2016 dataset surpasses the baseline method in terms of sensitivity by 1.59% and AUC by 1.88%. Furthermore, for the HAM10000 dataset, applying FABT outperforms the baseline approach in terms of recall by 13.75%, and with the sole implementation of FAGT, achieves a maximum accuracy of 94.44%.

View on arXiv PDF Code

Similar