CLSep 23, 2021
Corpus and Models for Lemmatisation and POS-tagging of Old FrenchJean-Baptiste Camps, Thibault Clérice, Frédéric Duval et al.
Old French is a typical example of an under-resourced historic languages, that furtherly displays animportant amount of linguistic variation. In this paper, we present the current results of a long going project (2015-...) and describe how we broached the difficult question of providing lemmatisation andPOS models for Old French with the help of neural taggers and the progressive constitution of dedicated corpora.