CLJul 24, 2020

FiSSA at SemEval-2020 Task 9: Fine-tuned For Feelings

arXiv:2007.12544v3991 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for sentiment analysis in multilingual social media contexts, placing tenth in a competition.

The paper tackled sentiment classification on Spanish-English code-mixed social media data by fine-tuning pre-trained Transformer models, achieving a best weighted F1-score of 0.739 on test data with the XLM-RoBERTa model.

In this paper, we present our approach for sentiment classification on Spanish-English code-mixed social media data in the SemEval-2020 Task 9. We investigate performance of various pre-trained Transformer models by using different fine-tuning strategies. We explore both monolingual and multilingual models with the standard fine-tuning method. Additionally, we propose a custom model that we fine-tune in two steps: once with a language modeling objective, and once with a task-specific objective. Although two-step fine-tuning improves sentiment classification performance over the base model, the large multilingual XLM-RoBERTa model achieves best weighted F1-score with 0.537 on development data and 0.739 on test data. With this score, our team jupitter placed tenth overall in the competition.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes