CLApr 2, 2021

Humor@IITK at SemEval-2021 Task 7: Large Language Models for Quantifying Humor and Offensiveness

arXiv:2104.00933v1711 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of subjective humor and offense detection for applications in recommendation systems and content moderation, but it is incremental as it applies existing large neural models to a new dataset.

This paper tackled the problem of detecting and rating humor and offensiveness in text, which is subjective due to factors like word senses and cultural knowledge, by exploring large neural models and their ensembles. The result was that their models achieved third place in one subtask and consistently ranked around the top 33% in others on the SemEval-2021 Task 7 benchmark.

Humor and Offense are highly subjective due to multiple word senses, cultural knowledge, and pragmatic competence. Hence, accurately detecting humorous and offensive texts has several compelling use cases in Recommendation Systems and Personalized Content Moderation. However, due to the lack of an extensive labeled dataset, most prior works in this domain haven't explored large neural models for subjective humor understanding. This paper explores whether large neural models and their ensembles can capture the intricacies associated with humor/offense detection and rating. Our experiments on the SemEval-2021 Task 7: HaHackathon show that we can develop reasonable humor and offense detection systems with such models. Our models are ranked third in subtask 1b and consistently ranked around the top 33% of the leaderboard for the remaining subtasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes