CLApr 2, 2021

Humor@IITK at SemEval-2021 Task 7: Large Language Models for Quantifying Humor and Offensiveness

Aishwarya Gupta, Avik Pal, Bholeshwar Khurana, Lakshay Tyagi, Ashutosh Modi

arXiv:2104.00933v131.5711 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of subjective humor and offense detection for applications in recommendation systems and content moderation, but it is incremental as it applies existing large neural models to a new dataset.

This paper tackled the problem of detecting and rating humor and offensiveness in text, which is subjective due to factors like word senses and cultural knowledge, by exploring large neural models and their ensembles. The result was that their models achieved third place in one subtask and consistently ranked around the top 33% in others on the SemEval-2021 Task 7 benchmark.

Humor and Offense are highly subjective due to multiple word senses, cultural knowledge, and pragmatic competence. Hence, accurately detecting humorous and offensive texts has several compelling use cases in Recommendation Systems and Personalized Content Moderation. However, due to the lack of an extensive labeled dataset, most prior works in this domain haven't explored large neural models for subjective humor understanding. This paper explores whether large neural models and their ensembles can capture the intricacies associated with humor/offense detection and rating. Our experiments on the SemEval-2021 Task 7: HaHackathon show that we can develop reasonable humor and offense detection systems with such models. Our models are ranked third in subtask 1b and consistently ranked around the top 33% of the leaderboard for the remaining subtasks.

View on arXiv PDF Code

Similar