CLMay 4, 2020

From SPMRL to NMRL: What Did We Learn (and Unlearn) in a Decade of Parsing Morphologically-Rich Languages (MRLs)?

Reut Tsarfaty, Dan Bareket, Stav Klein, Amit Seker

arXiv:2005.01330v131.21004 citations

Originality Synthesis-oriented

AI Analysis

This work addresses parsing challenges for morphologically-rich languages, which is an incremental contribution to NLP for specific linguistic domains.

The paper reflects on a decade of parsing morphologically-rich languages (MRLs), highlighting challenges and solutions from the pre-neural era and arguing that similar issues re-emerge in neural architectures, with preliminary support from multi-tagging experiments in Hebrew.

It has been exactly a decade since the first establishment of SPMRL, a research initiative unifying multiple research efforts to address the peculiar challenges of Statistical Parsing for Morphologically-Rich Languages (MRLs).Here we reflect on parsing MRLs in that decade, highlight the solutions and lessons learned for the architectural, modeling and lexical challenges in the pre-neural era, and argue that similar challenges re-emerge in neural architectures for MRLs. We then aim to offer a climax, suggesting that incorporating symbolic ideas proposed in SPMRL terms into nowadays neural architectures has the potential to push NLP for MRLs to a new level. We sketch strategies for designing Neural Models for MRLs (NMRL), and showcase preliminary support for these strategies via investigating the task of multi-tagging in Hebrew, a morphologically-rich, high-fusion, language

View on arXiv PDF

Similar