CL AI CYJun 22, 2025

Mental Health Equity in LLMs: Leveraging Multi-Hop Question Answering to Detect Amplified and Silenced Perspectives

Batool Haider, Atmika Gorti, Aman Chadha, Manas Gaur

arXiv:2506.18116v12 citationsh-index: 4

Originality Incremental advance

AI Analysis

This addresses the risk of LLMs propagating biases that harm marginalized groups in mental healthcare, though it is incremental as it builds on existing bias detection methods.

This work tackles the problem of detecting intersectional biases in LLM responses to mental health questions by introducing a multi-hop question answering framework, which identifies systematic disparities across sentiment, demographics, and conditions and achieves 66-94% bias reductions through debiasing techniques.

Large Language Models (LLMs) in mental healthcare risk propagating biases that reinforce stigma and harm marginalized groups. While previous research identified concerning trends, systematic methods for detecting intersectional biases remain limited. This work introduces a multi-hop question answering (MHQA) framework to explore LLM response biases in mental health discourse. We analyze content from the Interpretable Mental Health Instruction (IMHI) dataset across symptom presentation, coping mechanisms, and treatment approaches. Using systematic tagging across age, race, gender, and socioeconomic status, we investigate bias patterns at demographic intersections. We evaluate four LLMs: Claude 3.5 Sonnet, Jamba 1.6, Gemma 3, and Llama 4, revealing systematic disparities across sentiment, demographics, and mental health conditions. Our MHQA approach demonstrates superior detection compared to conventional methods, identifying amplification points where biases magnify through sequential reasoning. We implement two debiasing techniques: Roleplay Simulation and Explicit Bias Reduction, achieving 66-94% bias reductions through few-shot prompting with BBQ dataset examples. These findings highlight critical areas where LLMs reproduce mental healthcare biases, providing actionable insights for equitable AI development.

View on arXiv PDF

Similar