CVApr 4, 2024

Would Deep Generative Models Amplify Bias in Future Models?

arXiv:2404.03242v130 citationsh-index: 21CVPR
Originality Incremental advance
AI Analysis

This addresses concerns about bias propagation in AI systems for computer vision, but the findings are incremental as they show mixed effects rather than a clear amplification.

The paper investigates whether using AI-generated images as training data for computer vision models amplifies social biases, finding that introducing generated images from Stable Diffusion into COCO and CC3M datasets does not uniformly increase bias and can sometimes mitigate it in specific tasks.

We investigate the impact of deep generative models on potential social biases in upcoming computer vision models. As the internet witnesses an increasing influx of AI-generated images, concerns arise regarding inherent biases that may accompany them, potentially leading to the dissemination of harmful content. This paper explores whether a detrimental feedback loop, resulting in bias amplification, would occur if generated images were used as the training data for future models. We conduct simulations by progressively substituting original images in COCO and CC3M datasets with images generated through Stable Diffusion. The modified datasets are used to train OpenCLIP and image captioning models, which we evaluate in terms of quality and bias. Contrary to expectations, our findings indicate that introducing generated images during training does not uniformly amplify bias. Instead, instances of bias mitigation across specific tasks are observed. We further explore the factors that may influence these phenomena, such as artifacts in image generation (e.g., blurry faces) or pre-existing biases in the original datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes