CLApr 30, 2020

An Empirical Study of Pre-trained Transformers for Arabic Information Extraction

arXiv:2004.14519v51004 citationsHas Code
AI Analysis

This work addresses the need for better Arabic NLP tools, particularly for information extraction, but is incremental as it builds on existing BERT architectures with domain-specific customization.

The paper tackled the problem of limited study on multilingual pre-trained Transformers for Arabic information extraction by pre-training a customized bilingual BERT model, GigaBERT, which significantly outperformed existing models like mBERT, XLM-RoBERTa, and AraBERT in supervised and zero-shot transfer settings across four IE tasks.

Multilingual pre-trained Transformers, such as mBERT (Devlin et al., 2019) and XLM-RoBERTa (Conneau et al., 2020a), have been shown to enable the effective cross-lingual zero-shot transfer. However, their performance on Arabic information extraction (IE) tasks is not very well studied. In this paper, we pre-train a customized bilingual BERT, dubbed GigaBERT, that is designed specifically for Arabic NLP and English-to-Arabic zero-shot transfer learning. We study GigaBERT's effectiveness on zero-short transfer across four IE tasks: named entity recognition, part-of-speech tagging, argument role labeling, and relation extraction. Our best model significantly outperforms mBERT, XLM-RoBERTa, and AraBERT (Antoun et al., 2020) in both the supervised and zero-shot transfer settings. We have made our pre-trained models publicly available at https://github.com/lanwuwei/GigaBERT.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes