IVCVLGOct 23, 2024

Medical Imaging Complexity and its Effects on GAN Performance

CMU
arXiv:2410.17959v14 citationsh-index: 3ACCV Workshops
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of determining optimal training data requirements for GANs in medical imaging, which is incremental as it benchmarks existing methods.

The study investigated how training set size and image complexity affect GAN performance in medical image synthesis, finding that performance generally improves with larger datasets but degrades with higher complexity.

The proliferation of machine learning models in diverse clinical applications has led to a growing need for high-fidelity, medical image training data. Such data is often scarce due to cost constraints and privacy concerns. Alleviating this burden, medical image synthesis via generative adversarial networks (GANs) emerged as a powerful method for synthetically generating photo-realistic images based on existing sets of real medical images. However, the exact image set size required to efficiently train such a GAN is unclear. In this work, we experimentally establish benchmarks that measure the relationship between a sample dataset size and the fidelity of the generated images, given the dataset's distribution of image complexities. We analyze statistical metrics based on delentropy, an image complexity measure rooted in Shannon's entropy in information theory. For our pipeline, we conduct experiments with two state-of-the-art GANs, StyleGAN 3 and SPADE-GAN, trained on multiple medical imaging datasets with variable sample sizes. Across both GANs, general performance improved with increasing training set size but suffered with increasing complexity.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes