CL AIJul 17, 2024

Sharif-STR at SemEval-2024 Task 1: Transformer as a Regression Model for Fine-Grained Scoring of Textual Semantic Relations

Seyedeh Fatemeh Ebrahimi, Karim Akhavan Azari, Amirmasoud Iravani, Hadi Alizadeh, Zeinab Sadat Taghavi, Hossein Sameti

arXiv:2407.12426v114.628 citationsh-index: 17Has Code

Originality Synthesis-oriented

AI Analysis

This addresses semantic textual relatedness scoring for NLP applications, but is incremental as it applies an existing transformer method to a specific competition task.

The paper tackled sentence-level semantic textual relatedness scoring by fine-tuning RoBERTa transformer models across multiple languages, achieving correlations of 0.82 in English (19th rank) and 0.67 in Spanish (15th rank) but only 0.38 in Arabic (20th rank).

Semantic Textual Relatedness holds significant relevance in Natural Language Processing, finding applications across various domains. Traditionally, approaches to STR have relied on knowledge-based and statistical methods. However, with the emergence of Large Language Models, there has been a paradigm shift, ushering in new methodologies. In this paper, we delve into the investigation of sentence-level STR within Track A (Supervised) by leveraging fine-tuning techniques on the RoBERTa transformer. Our study focuses on assessing the efficacy of this approach across different languages. Notably, our findings indicate promising advancements in STR performance, particularly in Latin languages. Specifically, our results demonstrate notable improvements in English, achieving a correlation of 0.82 and securing a commendable 19th rank. Similarly, in Spanish, we achieved a correlation of 0.67, securing the 15th position. However, our approach encounters challenges in languages like Arabic, where we observed a correlation of only 0.38, resulting in a 20th rank.

View on arXiv PDF Code

Similar