CV LG IVJul 21, 2023

LatentAugment: Data Augmentation via Guided Manipulation of GAN's Latent Space

Lorenzo Tronchin, Minh H. Vu, Paolo Soda, Tommy Löfstedt

arXiv:2307.11375v15.04 citationsh-index: 31Has Code

Originality Incremental advance

AI Analysis

This addresses the need for more effective data augmentation in medical imaging and similar domains, though it is incremental as it builds on existing GAN-based approaches.

The paper tackles the problem of limited diversity in data augmentation by proposing LatentAugment, a method that manipulates GAN latent vectors to enhance synthetic image diversity and fidelity, resulting in improved generalization in MRI-to-CT translation tasks compared to standard and GAN-based methods.

Data Augmentation (DA) is a technique to increase the quantity and diversity of the training data, and by that alleviate overfitting and improve generalisation. However, standard DA produces synthetic data for augmentation with limited diversity. Generative Adversarial Networks (GANs) may unlock additional information in a dataset by generating synthetic samples having the appearance of real images. However, these models struggle to simultaneously address three key requirements: fidelity and high-quality samples; diversity and mode coverage; and fast sampling. Indeed, GANs generate high-quality samples rapidly, but have poor mode coverage, limiting their adoption in DA applications. We propose LatentAugment, a DA strategy that overcomes the low diversity of GANs, opening up for use in DA applications. Without external supervision, LatentAugment modifies latent vectors and moves them into latent space regions to maximise the synthetic images' diversity and fidelity. It is also agnostic to the dataset and the downstream task. A wide set of experiments shows that LatentAugment improves the generalisation of a deep model translating from MRI-to-CT beating both standard DA as well GAN-based sampling. Moreover, still in comparison with GAN-based sampling, LatentAugment synthetic samples show superior mode coverage and diversity. Code is available at: https://github.com/ltronchin/LatentAugment.

View on arXiv PDF Code

Similar