IVCVAug 8, 2022

Inflating 2D Convolution Weights for Efficient Generation of 3D Medical Images

arXiv:2208.03934v316 citationsh-index: 71
Originality Incremental advance
AI Analysis

This work solves the problem of efficient and high-quality 3D medical image generation for applications like brain and heart imaging, though it is incremental as it builds on existing GAN methods with novel architectures and weight inflation.

The paper tackles the problem of generating 3D medical images by addressing data scarcity and high parameter counts in 3D convolutions, achieving a 14.7 improvement in Fréchet inception distance and using only 48.5% of the parameters compared to a baseline.

The generation of three-dimensional (3D) medical images has great application potential since it takes into account the 3D anatomical structure. Two problems prevent effective training of a 3D medical generative model: (1) 3D medical images are expensive to acquire and annotate, resulting in an insufficient number of training images, and (2) a large number of parameters are involved in 3D convolution. Methods: We propose a novel GAN model called 3D Split&Shuffle-GAN. To address the 3D data scarcity issue, we first pre-train a two-dimensional (2D) GAN model using abundant image slices and inflate the 2D convolution weights to improve the initialization of the 3D GAN. Novel 3D network architectures are proposed for both the generator and discriminator of the GAN model to significantly reduce the number of parameters while maintaining the quality of image generation. Several weight inflation strategies and parameter-efficient 3D architectures are investigated. Results: Experiments on both heart (Stanford AIMI Coronary Calcium) and brain (Alzheimer's Disease Neuroimaging Initiative) datasets show that our method leads to improved 3D image generation quality (14.7 improvements on Fréchet inception distance) with significantly fewer parameters (only 48.5% of the baseline method). Conclusions: We built a parameter-efficient 3D medical image generation model. Due to the efficiency and effectiveness, it has the potential to generate high-quality 3D brain and heart images for real use cases.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes