CVCLJun 21, 2023

VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution

CMUOxford
arXiv:2306.12424v366 citationsh-index: 23Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of gender bias in AI systems for researchers and developers, though it is incremental as it builds on existing schemas and focuses on binary gender.

The authors introduced VisoGender, a dataset for evaluating gender bias in vision-language models, focusing on occupation-related biases within binary gender. They benchmarked several state-of-the-art models and found that they demonstrate bias in resolving gender, with captioning models generally less biased than Vision-Language Encoders.

We introduce VisoGender, a novel dataset for benchmarking gender bias in vision-language models. We focus on occupation-related biases within a hegemonic system of binary gender, inspired by Winograd and Winogender schemas, where each image is associated with a caption containing a pronoun relationship of subjects and objects in the scene. VisoGender is balanced by gender representation in professional roles, supporting bias evaluation in two ways: i) resolution bias, where we evaluate the difference between pronoun resolution accuracies for image subjects with gender presentations perceived as masculine versus feminine by human annotators and ii) retrieval bias, where we compare ratios of professionals perceived to have masculine and feminine gender presentations retrieved for a gender-neutral search query. We benchmark several state-of-the-art vision-language models and find that they demonstrate bias in resolving binary gender in complex scenes. While the direction and magnitude of gender bias depends on the task and the model being evaluated, captioning models are generally less biased than Vision-Language Encoders. Dataset and code are available at https://github.com/oxai/visogender

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes