CLFeb 18, 2025

Commonsense Reasoning in Arab Culture

arXiv:2502.12788v218 citationsh-index: 6ACL
Originality Incremental advance
AI Analysis

This addresses the problem of Anglocentric biases in AI for Arabic-speaking users by providing a culturally tailored dataset, though it is incremental as it builds on existing dataset creation methods.

The authors tackled the lack of culturally relevant commonsense reasoning datasets for Arabic by introducing ArabCulture, a dataset built from scratch by native speakers across 13 Arab countries, and found that open-weight language models with up to 32B parameters struggle with this task, showing performance variations across regions.

Despite progress in Arabic large language models, such as Jais and AceGPT, their evaluation on commonsense reasoning has largely relied on machine-translated datasets, which lack cultural depth and may introduce Anglocentric biases. Commonsense reasoning is shaped by geographical and cultural contexts, and existing English datasets fail to capture the diversity of the Arab world. To address this, we introduce ArabCulture, a commonsense reasoning dataset in Modern Standard Arabic (MSA), covering cultures of 13 countries across the Gulf, Levant, North Africa, and the Nile Valley. The dataset was built from scratch by engaging native speakers to write and validate culturally relevant questions for their respective countries. ArabCulture spans 12 daily life domains with 54 fine-grained subtopics, reflecting various aspects of social norms, traditions, and everyday experiences. Zero-shot evaluations show that open-weight language models with up to 32B parameters struggle to comprehend diverse Arab cultures, with performance varying across regions. These findings highlight the need for more culturally aware models and datasets tailored to the Arabic-speaking world.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes