CY AI CL HC LGAug 4, 2024

Representation Bias of Adolescents in AI: A Bilingual, Bicultural Study

Robert Wolfe, Aayushi Dangol, Bill Howe, Alexis Hiniker

arXiv:2408.01961v15.114 citationsh-index: 8Has Code

Originality Incremental advance

AI Analysis

This research addresses the problem of AI misrepresenting adolescents, a developmentally vulnerable group, by highlighting biases in language models that could perpetuate harmful stereotypes, with implications for media and AI ethics, though it is incremental as it builds on existing bias studies in AI.

The study examined how AI models, including static word embeddings and generative language models, depict teenagers in the U.S. and Nepal, finding that over 50% of words associated with teenagers in English GloVe and around 30% of outputs from models like GPT2-XL and LLaMA-2-7B focus on societal problems such as violence and drug use, while Nepali models show less bias, and workshops with adolescents revealed a disconnect between AI portrayals and their real-life experiences.

Popular and news media often portray teenagers with sensationalism, as both a risk to society and at risk from society. As AI begins to absorb some of the epistemic functions of traditional media, we study how teenagers in two countries speaking two languages: 1) are depicted by AI, and 2) how they would prefer to be depicted. Specifically, we study the biases about teenagers learned by static word embeddings (SWEs) and generative language models (GLMs), comparing these with the perspectives of adolescents living in the U.S. and Nepal. We find English-language SWEs associate teenagers with societal problems, and more than 50% of the 1,000 words most associated with teenagers in the pretrained GloVe SWE reflect such problems. Given prompts about teenagers, 30% of outputs from GPT2-XL and 29% from LLaMA-2-7B GLMs discuss societal problems, most commonly violence, but also drug use, mental illness, and sexual taboo. Nepali models, while not free of such associations, are less dominated by social problems. Data from workshops with N=13 U.S. adolescents and N=18 Nepalese adolescents show that AI presentations are disconnected from teenage life, which revolves around activities like school and friendship. Participant ratings of how well 20 trait words describe teens are decorrelated from SWE associations, with Pearson's r=.02, n.s. in English FastText and r=.06, n.s. in GloVe; and r=.06, n.s. in Nepali FastText and r=-.23, n.s. in GloVe. U.S. participants suggested AI could fairly present teens by highlighting diversity, while Nepalese participants centered positivity. Participants were optimistic that, if it learned from adolescents, rather than media sources, AI could help mitigate stereotypes. Our work offers an understanding of the ways SWEs and GLMs misrepresent a developmentally vulnerable group and provides a template for less sensationalized characterization.

View on arXiv PDF Code

Similar