CLJun 1, 2021

SemEval-2021 Task 1: Lexical Complexity Prediction

Matthew Shardlow, Richard Evans, Gustavo Henrique Paetzold, Marcos Zampieri

arXiv:2106.00473v131.7718 citations

Originality Synthesis-oriented

AI Analysis

This is an incremental task for NLP researchers, focusing on benchmarking methods for lexical complexity prediction.

The paper tackled the problem of predicting lexical complexity for single words and multi-word expressions in English, using an augmented corpus and attracting 198 teams with 54 and 37 submissions for the two sub-tasks.

This paper presents the results and main findings of SemEval-2021 Task 1 - Lexical Complexity Prediction. We provided participants with an augmented version of the CompLex Corpus (Shardlow et al 2020). CompLex is an English multi-domain corpus in which words and multi-word expressions (MWEs) were annotated with respect to their complexity using a five point Likert scale. SemEval-2021 Task 1 featured two Sub-tasks: Sub-task 1 focused on single words and Sub-task 2 focused on MWEs. The competition attracted 198 teams in total, of which 54 teams submitted official runs on the test data to Sub-task 1 and 37 to Sub-task 2.

View on arXiv PDF

Similar