CLAISep 24, 2025

Morphological Synthesizer for Ge'ez Language: Addressing Morphological Complexity and Resource Limitations

arXiv:2509.20341v181 citationsh-index: 2RAIL
Originality Incremental advance
AI Analysis

This addresses resource limitations for researchers and communities studying Ethiopian and Eritrean cultural heritage, though it is incremental as it focuses on a specific tool for a low-resource language.

The paper tackled the lack of NLP tools for Ge'ez, a linguistically complex language, by developing a rule-based morphological synthesizer that achieved 97.4% accuracy in generating surface words from roots.

Ge'ez is an ancient Semitic language renowned for its unique alphabet. It serves as the script for numerous languages, including Tigrinya and Amharic, and played a pivotal role in Ethiopia's cultural and religious development during the Aksumite kingdom era. Ge'ez remains significant as a liturgical language in Ethiopia and Eritrea, with much of the national identity documentation recorded in Ge'ez. These written materials are invaluable primary sources for studying Ethiopian and Eritrean philosophy, creativity, knowledge, and civilization. Ge'ez has a complex morphological structure with rich inflectional and derivational morphology, and no usable NLP has been developed and published until now due to the scarcity of annotated linguistic data, corpora, labeled datasets, and lexicons. Therefore, we propose a rule-based Ge'ez morphological synthesizer to generate surface words from root words according to the morphological structures of the language. We used 1,102 sample verbs, representing all verb morphological structures, to test and evaluate the system. The system achieves a performance of 97.4%, outperforming the baseline model and suggesting that future work should build a comprehensive system considering morphological variations of the language. Keywords: Ge'ez, NLP, morphology, morphological synthesizer, rule-based

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes