LG CL CV CYAug 1, 2023

The Bias Amplification Paradox in Text-to-Image Generation

Preethi Seshadri, Sameer Singh, Yanai Elazar

arXiv:2308.00755v233.987 citationsh-index: 27Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses bias measurement challenges in generative AI for researchers and practitioners, highlighting confounding factors in analyses.

The paper investigates bias amplification in text-to-image generation using Stable Diffusion, finding that gender-occupation biases appear amplified but largely stem from distribution shifts between training captions and prompts, with amplification decreasing drastically after accounting for these differences.

Bias amplification is a phenomenon in which models exacerbate biases or stereotypes present in the training data. In this paper, we study bias amplification in the text-to-image domain using Stable Diffusion by comparing gender ratios in training vs. generated images. We find that the model appears to amplify gender-occupation biases found in the training data (LAION) considerably. However, we discover that amplification can be largely attributed to discrepancies between training captions and model prompts. For example, an inherent difference is that captions from the training data often contain explicit gender information while our prompts do not, which leads to a distribution shift and consequently inflates bias measures. Once we account for distributional differences between texts used for training and generation when evaluating amplification, we observe that amplification decreases drastically. Our findings illustrate the challenges of comparing biases in models and their training data, and highlight confounding factors that impact analyses.

View on arXiv PDF Code

Similar