CLMay 21

Audience Engagement with Arabic Women's Social Empowerment and Wellbeing: A Decadal Corpus

arXiv:2605.2220483.21 citations
Predicted impact top 63% in CL · last 90 daysOriginality Synthesis-oriented
AI Analysis

For researchers in Arabic NLP and computational social science, this corpus provides a unique resource for studying women's empowerment discourse across Arabic dialects, though it is primarily a dataset contribution.

The paper introduces the Arabic Women and Society Corpus, a decade-long collection of 252,487 public Arabic Facebook posts on women's empowerment, with over 267 million user interactions, enabling large-scale analysis of gender discourse and emotional engagement.

This paper presents the Arabic Women and Society Corpus, a ten year collection of 252,487 public Arabic Facebook posts related to women's empowerment and social wellbeing. The corpus was collected from 51,660 pages across 77 countries between 2013 and 2024, resulting in more than 267 million user interactions. Each post includes engagement metrics such as shares, comments, and emotional reactions, providing a unique view of audience sentiment and social attention. The data were processed using an automated pipeline with language identification, normalization, and metadata cleaning to ensure reliability and reproducibility. The corpus enables large scale analysis of gender discourse, social reform, and emotional engagement across Arabic dialects. It supports research in Arabic natural language processing, computational social science, and digital communication studies. The dataset and accompanying documentation will be released under request for research use.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes