CLLGJun 13, 2020

Transferring Monolingual Model to Low-Resource Language: The Case of Tigrinya

arXiv:2006.07698v218 citations
AI Analysis

This work addresses the challenge of applying NLP to low-resource languages like Tigrinya, offering a cost-effective solution that is incremental in nature.

The paper tackles the problem of high pre-training costs for transformer models in low-resource languages by proposing a transfer learning method to adapt a monolingual English XLNet model to Tigrinya, achieving a 78.88% F1-score on a sentiment analysis dataset with only 10k examples, outperforming BERT and mBERT by 10% and 7% respectively.

In recent years, transformer models have achieved great success in natural language processing (NLP) tasks. Most of the current state-of-the-art NLP results are achieved by using monolingual transformer models, where the model is pre-trained using a single language unlabelled text corpus. Then, the model is fine-tuned to the specific downstream task. However, the cost of pre-training a new transformer model is high for most languages. In this work, we propose a cost-effective transfer learning method to adopt a strong source language model, trained from a large monolingual corpus to a low-resource language. Thus, using XLNet language model, we demonstrate competitive performance with mBERT and a pre-trained target language model on the cross-lingual sentiment (CLS) dataset and on a new sentiment analysis dataset for low-resourced language Tigrinya. With only 10k examples of the given Tigrinya sentiment analysis dataset, English XLNet has achieved 78.88% F1-Score outperforming BERT and mBERT by 10% and 7%, respectively. More interestingly, fine-tuning (English) XLNet model on the CLS dataset has promising results compared to mBERT and even outperformed mBERT for one dataset of the Japanese language.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes