CLApr 8, 2024

ÚFAL LatinPipe at EvaLatin 2024: Morphosyntactic Analysis of Latin

arXiv:2404.05839v25 citationsh-index: 4Has CodeLT4HALA
Originality Synthesis-oriented
AI Analysis

This addresses the problem of automated linguistic analysis for Latin texts, though it appears incremental as it builds on existing pre-trained models and methods.

The authors tackled morphosyntactic analysis of Latin by developing LatinPipe, a system that jointly learns dependency parsing and morphological analysis, which won the EvaLatin 2024 Dependency Parsing shared task.

We present LatinPipe, the winning submission to the EvaLatin 2024 Dependency Parsing shared task. Our system consists of a fine-tuned concatenation of base and large pre-trained LMs, with a dot-product attention head for parsing and softmax classification heads for morphology to jointly learn both dependency parsing and morphological analysis. It is trained by sampling from seven publicly available Latin corpora, utilizing additional harmonization of annotations to achieve a more unified annotation style. Before fine-tuning, we train the system for a few initial epochs with frozen weights. We also add additional local relative contextualization by stacking the BiLSTM layers on top of the Transformer(s). Finally, we ensemble output probability distributions from seven randomly instantiated networks for the final submission. The code is available at https://github.com/ufal/evalatin2024-latinpipe.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes