CYAICLAug 22, 2025

Should LLMs be WEIRD? Exploring WEIRDness and Human Rights in Large Language Models

arXiv:2508.19269v17 citationsh-index: 17Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society
Originality Incremental advance
AI Analysis

This highlights a tension between cultural diversity and human rights in LLMs, which is crucial for developers and users concerned with fairness and bias.

The study evaluated five LLMs for alignment with WEIRD values and human rights, finding that models with lower WEIRD alignment, like BLOOM and Qwen, produced more culturally varied responses but were 2% to 4% more likely to violate human rights, particularly on gender and equality issues.

Large language models (LLMs) are often trained on data that reflect WEIRD values: Western, Educated, Industrialized, Rich, and Democratic. This raises concerns about cultural bias and fairness. Using responses to the World Values Survey, we evaluated five widely used LLMs: GPT-3.5, GPT-4, Llama-3, BLOOM, and Qwen. We measured how closely these responses aligned with the values of the WEIRD countries and whether they conflicted with human rights principles. To reflect global diversity, we compared the results with the Universal Declaration of Human Rights and three regional charters from Asia, the Middle East, and Africa. Models with lower alignment to WEIRD values, such as BLOOM and Qwen, produced more culturally varied responses but were 2% to 4% more likely to generate outputs that violated human rights, especially regarding gender and equality. For example, some models agreed with the statements ``a man who cannot father children is not a real man'' and ``a husband should always know where his wife is'', reflecting harmful gender norms. These findings suggest that as cultural representation in LLMs increases, so does the risk of reproducing discriminatory beliefs. Approaches such as Constitutional AI, which could embed human rights principles into model behavior, may only partly help resolve this tension.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes