CLIRLGDec 30, 2019

AraNet: A Deep Learning Toolkit for Arabic Social Media

arXiv:1912.13072v21002 citations
Originality Synthesis-oriented
AI Analysis

This addresses the need for comprehensive Arabic NLP tools for social media analysis, though it is incremental as it applies existing BERT models to new datasets.

The authors tackled the problem of processing Arabic social media by developing AraNet, a deep learning toolkit that predicts age, dialect, gender, emotion, irony, and sentiment, achieving state-of-the-art performance on some tasks and competitive results on others.

We describe AraNet, a collection of deep learning Arabic social media processing tools. Namely, we exploit an extensive host of publicly available and novel social media datasets to train bidirectional encoders from transformer models (BERT) to predict age, dialect, gender, emotion, irony, and sentiment. AraNet delivers state-of-the-art performance on a number of the cited tasks and competitively on others. In addition, AraNet has the advantage of being exclusively based on a deep learning framework and hence feature engineering free. To the best of our knowledge, AraNet is the first to performs predictions across such a wide range of tasks for Arabic NLP and thus meets a critical needs. We publicly release AraNet to accelerate research and facilitate comparisons across the different tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes