CYAIApr 28

Assessing the Geographic Diversity of AI's Platial Representations in Image Generation

arXiv:2606.0518856.21 citations
Predicted impact top 27% in CY · last 90 daysOriginality Incremental advance
AI Analysis

For GIScience and AI ethics, it highlights a novel dimension of bias in multimodal AI outputs, though the findings are incremental.

This paper assesses geographic diversity in AI image generation using GPT and DALL-E models, finding that older models can show greater diversity despite lower quality, and prompt revision yields more diversity than image generation. The analysis reveals model homogeneity leading to stereotypical place representations.

(Gen)AI diversity is not merely an ethical issue. From the perspective of geographic information science (GIScience), it could be interpreted as a function of uncertainty and as a form of cognitive bias, embedded in AI outputs. Recent work has sought to develop information-theoretic diversity measures and apply them to evaluate AI-chatbot outputs in a geographic context. As the AI ecosystem to which we are exposed on a daily basis becomes rapidly multimodal, we believe it is important to examine geographic diversity across various modalities. Focusing on images, this paper aims to fill this research gap. First, we select the GPT and DALL-E models as state-of-the-art examples and point out how assessing their geographic diversity involves various stages, including prompt revision and image generation. Then, taking inspiration from species diversity measures in ecological research, we incorporate similarity weighting into the measurement of geographic diversity. Next, we demonstrate how to evaluate geographic diversity in image generation through a case study. Our analysis reveals several counterintuitive findings. For instance, older models can exhibit greater geographic diversity despite producing lower-quality images, and prompt revision yields greater geographic diversity than image generation. At the same time, we observe explicit model homogeneity underlying the lack of geographic diversity, as the selected models consistently depict the same prototypical geo-specific feature or similar features. This is concerning, as it risks producing stereotypical representations of places.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes