CLMar 16, 2023

SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches for news genre, topic and persuasion technique classification

Ben Wu, Olesya Razuvayevskaya, Freddy Heppell, João A. Leite, Carolina Scarton, Kalina Bontcheva, Xingyi Song

arXiv:2303.09421v226.5226 citationsh-index: 20Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of analyzing online news content for misinformation and bias across languages, though it is incremental as it builds on existing models and competition tasks.

The paper tackled the problem of classifying news genre, framing, and persuasion techniques in online news across multiple languages, achieving top rankings in SemEval-2023 Task 3, including joint-first for German in genre classification and first place in 3 languages for framing.

This paper describes our approach for SemEval-2023 Task 3: Detecting the category, the framing, and the persuasion techniques in online news in a multi-lingual setup. For Subtask 1 (News Genre), we propose an ensemble of fully trained and adapter mBERT models which was ranked joint-first for German, and had the highest mean rank of multi-language teams. For Subtask 2 (Framing), we achieved first place in 3 languages, and the best average rank across all the languages, by using two separate ensembles: a monolingual RoBERTa-MUPPETLARGE and an ensemble of XLM-RoBERTaLARGE with adapters and task adaptive pretraining. For Subtask 3 (Persuasion Techniques), we train a monolingual RoBERTa-Base model for English and a multilingual mBERT model for the remaining languages, which achieved top 10 for all languages, including 2nd for English. For each subtask, we compared monolingual and multilingual approaches, and considered class imbalance techniques.

View on arXiv PDF Code

Similar