CLSep 30, 2024

When Stereotypes GTG: The Impact of Predictive Text Suggestions on Gender Bias in Human-AI Co-Writing

arXiv:2409.20390v21 citationsh-index: 12
Originality Incremental advance
AI Analysis

This addresses the problem of AI amplifying social biases in collaborative writing for users, though it shows technical debiasing is only partially effective.

The study investigated how predictive text suggestions from language models influence gender bias in human-AI co-writing, finding that anti-stereotypical suggestions increased anti-stereotypical stories by a significant rate, but pro-stereotypical narratives still dominated.

AI-based systems such as language models have been shown to replicate and even amplify social biases reflected in their training data. Among other questionable behaviors, this can lead to AI-generated text--and text suggestions--that contain normatively inappropriate stereotypical associations. Little is known, however, about how this behavior impacts the writing produced by people using these systems. We address this gap by measuring how much impact stereotypes or anti-stereotypes in English single-word LM predictive text suggestions have on the stories that people write using those tools in a co-writing scenario. We find that ($n=414$), LM suggestions that challenge stereotypes sometimes lead to a significantly increased rate of anti-stereotypical co-written stories. However, despite this increased rate of anti-stereotypical stories, pro-stereotypical narratives still dominated the co-written stories, demonstrating that technical debiasing is only a partially effective strategy to alleviate harms from human-AI collaboration.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes