CLMar 17, 2023

Memotion 3: Dataset on Sentiment and Emotion Analysis of Codemixed Hindi-English Memes

AppleStanford
arXiv:2303.09892v326 citationsh-index: 53Has Code
AI Analysis

This addresses the problem of limited resources for multimodal analysis in non-English contexts, specifically for researchers in natural language processing and social media analysis, but it is incremental as it builds on prior Memotion datasets.

The authors tackled the lack of datasets for analyzing sentiment and emotion in Hindi-English codemixed memes by introducing Memotion 3, a new dataset of 10,000 annotated memes, and provided a baseline for the task.

Memes are the new-age conveyance mechanism for humor on social media sites. Memes often include an image and some text. Memes can be used to promote disinformation or hatred, thus it is crucial to investigate in details. We introduce Memotion 3, a new dataset with 10,000 annotated memes. Unlike other prevalent datasets in the domain, including prior iterations of Memotion, Memotion 3 introduces Hindi-English Codemixed memes while prior works in the area were limited to only the English memes. We describe the Memotion task, the data collection and the dataset creation methodologies. We also provide a baseline for the task. The baseline code and dataset will be made available at https://github.com/Shreyashm16/Memotion-3.0

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes