Classifying several dialectal Nawatl varieties
This work addresses a domain-specific problem for linguists and communities preserving indigenous languages, but it appears incremental as it applies existing methods to a new dataset.
The researchers tackled the problem of classifying Nawatl dialectal varieties, which has limited computational resources, by applying machine learning and neural networks, but no concrete results or numbers are provided in the abstract.
Mexico is a country with a large number of indigenous languages, among which the most widely spoken is Nawatl, with more than two million people currently speaking it (mainly in North and Central America). Despite its rich cultural heritage, which dates back to the 15th century, Nawatl is a language with few computer resources. The problem is compounded when it comes to its dialectal varieties, with approximately 30 varieties recognised, not counting the different spellings in the written forms of the language. In this research work, we addressed the problem of classifying Nawatl varieties using Machine Learning and Neural Networks.