CLMar 30, 2021

Collaborative construction of lexicographic and parallel datasets for African languages: first assessment

arXiv:2103.16712v1Has Code

Originality Synthesis-oriented

AI Analysis

This addresses the resource scarcity problem for NLP practitioners working with African languages, but it is incremental as it focuses on dataset creation rather than novel methods.

The researchers tackled the lack of resources for African languages in NLP and AI by building open-source platforms for collaborative lexicographic data construction, reporting initial results after 2 years of work.

Faced with a considerable lack of resources in African languages to carry out work in Natural Language Processing (NLP), Natural Language Understanding (NLU) and artificial intelligence, the research teams of NTeALan association has set itself the objective of building open-source platforms for the collaborative construction of lexicographic data in African languages. In this article, we present our first reports after 2 years of collaborative construction of lexicographic resources useful for African NLP tools.

View on arXiv PDF

Similar