CLJul 8, 2021

COMBO: a new module for EUD parsing

arXiv:2107.03809v1712 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This work addresses parsing for multiple languages in computational linguistics, but it is incremental as it builds on existing methods.

The paper tackled parsing raw texts into Enhanced Universal Dependencies (EUD) for 17 languages, achieving an average ELAS of 83.79% and ranking fourth in the IWPT 2021 shared task.

We introduce the COMBO-based approach for EUD parsing and its implementation, which took part in the IWPT 2021 EUD shared task. The goal of this task is to parse raw texts in 17 languages into Enhanced Universal Dependencies (EUD). The proposed approach uses COMBO to predict UD trees and EUD graphs. These structures are then merged into the final EUD graphs. Some EUD edge labels are extended with case information using a single language-independent expansion rule. In the official evaluation, the solution ranked fourth, achieving an average ELAS of 83.79%. The source code is available at https://gitlab.clarin-pl.eu/syntactic-tools/combo.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes