CLOct 14, 2024

Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts

arXiv:2410.11084v128 citationsh-index: 21EMNLP
Originality Incremental advance
AI Analysis

This addresses gender equity issues in AI decision-making for relationship scenarios, highlighting incremental insights into model biases.

The study investigated gender bias in large language models' decision-making regarding relationship conflicts, finding that models consistently favored women, then gender-neutral names, and lastly men, with safety guardrails reducing bias.

Large language models (LLMs) acquire beliefs about gender from training data and can therefore generate text with stereotypical gender attitudes. Prior studies have demonstrated model generations favor one gender or exhibit stereotypes about gender, but have not investigated the complex dynamics that can influence model reasoning and decision-making involving gender. We study gender equity within LLMs through a decision-making lens with a new dataset, DeMET Prompts, containing scenarios related to intimate, romantic relationships. We explore nine relationship configurations through name pairs across three name lists (men, women, neutral). We investigate equity in the context of gender roles through numerous lenses: typical and gender-neutral names, with and without model safety enhancements, same and mixed-gender relationships, and egalitarian versus traditional scenarios across various topics. While all models exhibit the same biases (women favored, then those with gender-neutral names, and lastly men), safety guardrails reduce bias. In addition, models tend to circumvent traditional male dominance stereotypes and side with 'traditionally female' individuals more often, suggesting relationships are viewed as a female domain by the models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes