IV CVSep 19, 2024

I2I-Galip: Unsupervised Medical Image Translation Using Generative Adversarial CLIP

arXiv:2409.12399v18.52 citationsh-index: 8Has Code

Originality Incremental advance

AI Analysis

This addresses the problem of inefficient multi-domain translation for medical imaging researchers, though it is incremental as it builds on existing GAN and CLIP methods.

The paper tackles unpaired medical image translation by proposing I2I-Galip, which uses a pre-trained CLIP model to enable efficient multi-domain translation with a single lightweight generator, achieving superior performance on MRI and CT datasets.

Unpaired image-to-image translation is a challenging task due to the absence of paired examples, which complicates learning the complex mappings between the distinct distributions of the source and target domains. One of the most commonly used approach for this task is CycleGAN which requires the training of a new pair of generator-discriminator networks for each domain pair. In this paper, we propose a new image-to-image translation framework named Image-to-Image-Generative-Adversarial-CLIP (I2I-Galip) where we utilize a pre-trained multi-model foundation model (i.e., CLIP) to mitigate the need of separate generator-discriminator pairs for each source-target mapping while achieving better and more efficient multi-domain translation. By utilizing the massive knowledge gathered during pre-training a foundation model, our approach makes use of a single lightweight generator network with ~13M parameters for the multi-domain image translation task. Comprehensive experiments on translation performance in public MRI and CT datasets show the superior performance of the proposed framework over the existing approaches. Code will be available (https://github.com/yilmazkorkmaz1/I2I-Galip).

View on arXiv PDF Code

Similar