CLApr 13, 2023

SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)

arXiv:2304.06845v2240 citationsh-index: 56Has Code
AI Analysis

This addresses the problem of limited resources for sentiment analysis in African languages for NLP researchers, but it is incremental as it applies existing methods to new data.

The paper tackled sentiment analysis for 14 African languages by introducing the AfriSenti-SemEval shared task, with results showing best weighted F1 scores of 71.31 for monolingual classification and 75.06 for multilingual classification.

We present the first Africentric SemEval Shared task, Sentiment Analysis for African Languages (AfriSenti-SemEval) - The dataset is available at https://github.com/afrisenti-semeval/afrisent-semeval-2023. AfriSenti-SemEval is a sentiment classification challenge in 14 African languages: Amharic, Algerian Arabic, Hausa, Igbo, Kinyarwanda, Moroccan Arabic, Mozambican Portuguese, Nigerian Pidgin, Oromo, Swahili, Tigrinya, Twi, Xitsonga, and Yorùbá (Muhammad et al., 2023), using data labeled with 3 sentiment classes. We present three subtasks: (1) Task A: monolingual classification, which received 44 submissions; (2) Task B: multilingual classification, which received 32 submissions; and (3) Task C: zero-shot classification, which received 34 submissions. The best performance for tasks A and B was achieved by NLNDE team with 71.31 and 75.06 weighted F1, respectively. UCAS-IIE-NLP achieved the best average score for task C with 58.15 weighted F1. We describe the various approaches adopted by the top 10 systems and their approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes