CLLGJun 10, 2023

Towards Arabic Multimodal Dataset for Sentiment Analysis

arXiv:2306.06322v111 citationsh-index: 11
Originality Synthesis-oriented
AI Analysis

This addresses the problem of limited resources for Arabic multimodal sentiment analysis, which is incremental as it builds on existing methods for a specific language.

The authors tackled the lack of standard datasets for Arabic multimodal sentiment analysis by creating a new dataset using transformers and feature extraction tools, and experiments showed promising results despite its small size.

Multimodal Sentiment Analysis (MSA) has recently become a centric research direction for many real-world applications. This proliferation is due to the fact that opinions are central to almost all human activities and are key influencers of our behaviors. In addition, the recent deployment of Deep Learning-based (DL) models has proven their high efficiency for a wide range of Western languages. In contrast, Arabic DL-based multimodal sentiment analysis (MSA) is still in its infantile stage due, mainly, to the lack of standard datasets. In this paper, our investigation is twofold. First, we design a pipeline that helps building our Arabic Multimodal dataset leveraging both state-of-the-art transformers and feature extraction tools within word alignment techniques. Thereafter, we validate our dataset using state-of-the-art transformer-based model dealing with multimodality. Despite the small size of the outcome dataset, experiments show that Arabic multimodality is very promising

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes