CLAICVApr 13, 2022

TIB-VA at SemEval-2022 Task 5: A Multimodal Architecture for the Detection and Classification of Misogynous Memes

arXiv:2204.06299v1629 citationsh-index: 25
Originality Incremental advance
AI Analysis

This addresses the challenge of identifying hateful content targeting women on social media, which affects online users, but it is incremental as it builds on existing multimodal detection methods.

The paper tackled the problem of detecting and classifying misogynous memes by combining textual and visual features, achieving the best result in the SemEval-2022 Task 5 challenge for classifying misogynous content and its sub-classes.

The detection of offensive, hateful content on social media is a challenging problem that affects many online users on a daily basis. Hateful content is often used to target a group of people based on ethnicity, gender, religion and other factors. The hate or contempt toward women has been increasing on social platforms. Misogynous content detection is especially challenging when textual and visual modalities are combined to form a single context, e.g., an overlay text embedded on top of an image, also known as meme. In this paper, we present a multimodal architecture that combines textual and visual features in order to detect misogynous meme content. The proposed architecture is evaluated in the SemEval-2022 Task 5: MAMI - Multimedia Automatic Misogyny Identification challenge under the team name TIB-VA. Our solution obtained the best result in the Task-B where the challenge is to classify whether a given document is misogynous and further identify the main sub-classes of shaming, stereotype, objectification, and violence.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes