CLHCDec 3, 2025

Is Lying Only Sinful in Islam? Exploring Religious Bias in Multilingual Large Language Models Across Major Religions

arXiv:2512.03943v11 citationsh-index: 10
Originality Synthesis-oriented
AI Analysis

This addresses bias issues in AI for users in religious contexts, particularly in multilingual settings, but is incremental as it builds on existing bias detection work.

The paper tackled bias in multilingual large language models regarding religion by introducing the BRAND dataset with over 2,400 entries across four major South Asian religions, finding that models perform better in English than Bengali and consistently show bias toward Islam.

While recent developments in large language models have improved bias detection and classification, sensitive subjects like religion still present challenges because even minor errors can result in severe misunderstandings. In particular, multilingual models often misrepresent religions and have difficulties being accurate in religious contexts. To address this, we introduce BRAND: Bilingual Religious Accountable Norm Dataset, which focuses on the four main religions of South Asia: Buddhism, Christianity, Hinduism, and Islam, containing over 2,400 entries, and we used three different types of prompts in both English and Bengali. Our results indicate that models perform better in English than in Bengali and consistently display bias toward Islam, even when answering religion-neutral questions. These findings highlight persistent bias in multilingual models when similar questions are asked in different languages. We further connect our findings to the broader issues in HCI regarding religion and spirituality.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes