CY CLJan 20

Language, Caste, and Context: Demographic Disparities in AI-Generated Explanations Across Indian and American STEM Educational Systems

Amogh Gupta, Niharika Patil, Sourojit Ghosh, SnehalKumar, S Gaikwad

arXiv:2601.14506v11.2

Originality Incremental advance

AI Analysis

This work highlights demographic disparities in AI-generated educational content, which could exacerbate inequalities for marginalized students globally, though it is incremental as it extends intersectional bias analysis to new contexts.

The paper investigates how large language models (LLMs) exhibit biases in explanation quality for student profiles with intersecting marginalized identities, such as caste and language in India and race and school type in America, finding that models systematically provide lower-quality, simpler explanations to these profiles across different cultural contexts.

The popularization of AI chatbot usage globally has created opportunities for research into their benefits and drawbacks, especially for students using AI assistants for coursework support. This paper asks: how do LLMs perceive the intellectual capabilities of student profiles from intersecting marginalized identities across different cultural contexts? We conduct one of the first large-scale intersectional analyses on LLM explanation quality for Indian and American undergraduate profiles preparing for engineering entrance examinations. By constructing profiles combining multiple demographic dimensions including caste, medium of instruction, and school boards in India, and race, HBCU attendance, and school type in America, alongside universal factors like income and college tier, we examine how quality varies across these factors. We observe biases providing lower-quality outputs to profiles with marginalized backgrounds in both contexts. LLMs such as Qwen2.5-32B-Instruct and GPT-4o demonstrate granular understandings of context-specific discrimination, systematically providing simpler explanations to Hindi/Regional-medium students in India and HBCU profiles in America, treating these as proxies for lower capability. Even when marginalized profiles attain social mobility by getting accepted into elite institutions, they still receive more simplistic explanations, showing how demographic information is inextricably linked to LLM biases. Different models (Qwen2.5-32B-Instruct, GPT-4o, GPT-4o-mini, GPT-OSS 20B) embed similar biases against historically marginalized populations in both contexts, preventing profiles from switching between AI assistants for better results. Our findings have strong implications for AI incorporation into global engineering education.

View on arXiv PDF

Similar