CLSep 13, 2023

OYXOY: A Modern NLP Test Suite for Modern Greek

arXiv:2309.07009v2103 citationsh-index: 19
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of under-resourced evaluation for Greek NLP, providing foundational tools for researchers and developers in that domain, though it is incremental in adapting existing methods to a new language.

The paper tackles the lack of evaluation resources for Greek NLP by introducing a test suite with four expert-verified tasks, including natural language inference, word sense disambiguation, and metaphor detection, and demonstrates its challenging nature through baseline experiments with state-of-the-art models.

This paper serves as a foundational step towards the development of a linguistically motivated and technically relevant evaluation suite for Greek NLP. We initiate this endeavor by introducing four expert-verified evaluation tasks, specifically targeted at natural language inference, word sense disambiguation (through example comparison or sense selection) and metaphor detection. More than language-adapted replicas of existing tasks, we contribute two innovations which will resonate with the broader resource and evaluation community. Firstly, our inference dataset is the first of its kind, marking not just \textit{one}, but rather \textit{all} possible inference labels, accounting for possible shifts due to e.g. ambiguity or polysemy. Secondly, we demonstrate a cost-efficient method to obtain datasets for under-resourced languages. Using ChatGPT as a language-neutral parser, we transform the Dictionary of Standard Modern Greek into a structured format, from which we derive the other three tasks through simple projections. Alongside each task, we conduct experiments using currently available state of the art machinery. Our experimental baselines affirm the challenging nature of our tasks and highlight the need for expedited progress in order for the Greek NLP ecosystem to keep pace with contemporary mainstream research.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes