Memotion 3: Dataset on Sentiment and Emotion Analysis of Codemixed Hindi-English Memes
This addresses the problem of limited resources for multimodal analysis in non-English contexts, specifically for researchers in natural language processing and social media analysis, but it is incremental as it builds on prior Memotion datasets.
The authors tackled the lack of datasets for analyzing sentiment and emotion in Hindi-English codemixed memes by introducing Memotion 3, a new dataset of 10,000 annotated memes, and provided a baseline for the task.
Memes are the new-age conveyance mechanism for humor on social media sites. Memes often include an image and some text. Memes can be used to promote disinformation or hatred, thus it is crucial to investigate in details. We introduce Memotion 3, a new dataset with 10,000 annotated memes. Unlike other prevalent datasets in the domain, including prior iterations of Memotion, Memotion 3 introduces Hindi-English Codemixed memes while prior works in the area were limited to only the English memes. We describe the Memotion task, the data collection and the dataset creation methodologies. We also provide a baseline for the task. The baseline code and dataset will be made available at https://github.com/Shreyashm16/Memotion-3.0