COMBO: a new module for EUD parsing
This work addresses parsing for multiple languages in computational linguistics, but it is incremental as it builds on existing methods.
The paper tackled parsing raw texts into Enhanced Universal Dependencies (EUD) for 17 languages, achieving an average ELAS of 83.79% and ranking fourth in the IWPT 2021 shared task.
We introduce the COMBO-based approach for EUD parsing and its implementation, which took part in the IWPT 2021 EUD shared task. The goal of this task is to parse raw texts in 17 languages into Enhanced Universal Dependencies (EUD). The proposed approach uses COMBO to predict UD trees and EUD graphs. These structures are then merged into the final EUD graphs. Some EUD edge labels are extended with case information using a single language-independent expansion rule. In the official evaluation, the solution ranked fourth, achieving an average ELAS of 83.79%. The source code is available at https://gitlab.clarin-pl.eu/syntactic-tools/combo.