CVCYApr 8, 2025

Text-to-Image Models and Their Representation of People from Different Nationalities Engaging in Activities

arXiv:2504.06313v3h-index: 5
Originality Synthesis-oriented
AI Analysis

This study addresses biases in AI-generated imagery for diverse global populations, highlighting significant representation issues that could impact fairness and inclusivity in applications.

The paper investigated how a text-to-image model represents people from 208 nationalities in generated images of typical activities, finding that 52.88% and 27.4% of images depicted individuals in traditional attire across two scenarios, with disproportionate effects on regions like the Middle East & North Africa and Sub-Saharan Africa.

This paper investigates how a popular Text-to-Image (T2I) model represents people from 208 different nationalities when prompted to generate images of individuals engaging in typical activities. Two scenarios were developed, and 644 images were generated based on input prompts that specified nationalities. The results show that in one scenario, 52.88% of images, and in the other, 27.4%, depict individuals wearing traditional attire. A statistically significant relationship was observed between this representation pattern and regions. This indicates that the issue disproportionately affects certain areas, particularly the Middle East & North Africa and Sub-Saharan Africa. A notable association with income groups was also found. CLIP, ALIGN, and GPT-4.1 mini were used to measure alignment scores between generated images and 3320 prompts and captions, with findings indicating statistically significant higher scores for images featuring individuals in traditional attire in one scenario. The study also examined revised prompts, finding that the word "traditional" was added by the model to 88.46% of prompts for one scenario. These findings provide valuable insights into T2I models' representation of individuals across different countries, demonstrating how the examined model prioritizes traditional characteristics despite their impracticality for the given activities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes