CLSep 9, 2018

SHOMA at Parseme Shared Task on Automatic Identification of VMWEs: Neural Multiword Expression Tagging with High Generalisation

arXiv:1809.03056v14 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of accurately identifying verbal multiword expressions for natural language processing applications, representing a strong specific gain in this domain.

The paper tackled the problem of multiword expression identification by developing a language-independent deep learning architecture, which achieved a macro-average MWE-based F1 score of 58.09 and outperformed all other systems in a shared task, with particular strength in generalizing to unseen data.

This paper presents a language-independent deep learning architecture adapted to the task of multiword expression (MWE) identification. We employ a neural architecture comprising of convolutional and recurrent layers with the addition of an optional CRF layer at the top. This system participated in the open track of the Parseme shared task on automatic identification of verbal MWEs due to the use of pre-trained wikipedia word embeddings. It outperformed all participating systems in both open and closed tracks with the overall macro-average MWE-based F1 score of 58.09 averaged among all languages. A particular strength of the system is its superior performance on unseen data entries.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes