CVIVMar 23, 2023

Medical diffusion on a budget: Textual Inversion for medical image generation

arXiv:2303.13430v228 citationsh-index: 48
Originality Incremental advance
AI Analysis

This work addresses the problem of data scarcity and privacy in medical imaging for researchers and clinicians, though it is incremental as it builds on existing methods.

The study tackled the challenge of generating medical images with limited datasets by adapting pre-trained Stable Diffusion models using Textual Inversion, achieving diagnostically accurate images and increasing prostate cancer detection AUC from 0.78 to 0.80.

Diffusion models for text-to-image generation, known for their efficiency, accessibility, and quality, have gained popularity. While inference with these systems on consumer-grade GPUs is increasingly feasible, training from scratch requires large captioned datasets and significant computational resources. In medical image generation, the limited availability of large, publicly accessible datasets with text reports poses challenges due to legal and ethical concerns. This work shows that adapting pre-trained Stable Diffusion models to medical imaging modalities is achievable by training text embeddings using Textual Inversion. In this study, we experimented with small medical datasets (100 samples each from three modalities) and trained within hours to generate diagnostically accurate images, as judged by an expert radiologist. Experiments with Textual Inversion training and inference parameters reveal the necessity of larger embeddings and more examples in the medical domain. Classification experiments show an increase in diagnostic accuracy (AUC) for detecting prostate cancer on MRI, from 0.78 to 0.80. Further experiments demonstrate embedding flexibility through disease interpolation, combining pathologies, and inpainting for precise disease appearance control. The trained embeddings are compact (less than 1 MB), enabling easy data sharing with reduced privacy concerns.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes