CLDec 18, 2020

An Empirical Study of Using Pre-trained BERT Models for Vietnamese Relation Extraction Task at VLSP 2020

arXiv:2012.10275v2821 citations

Originality Synthesis-oriented

AI Analysis

This work provides an incremental improvement for Vietnamese relation extraction by comparing and combining existing BERT-based models.

This paper empirically studies pre-trained BERT models for Vietnamese relation extraction, finding that the NlpHUST/vibert4news model significantly outperforms FPTAI/vibert. An ensemble model combining R-BERT and BERT with entity starts showed slight improvements over single models.

In this paper, we present an empirical study of using pre-trained BERT models for the relation extraction task at the VLSP 2020 Evaluation Campaign. We applied two state-of-the-art BERT-based models: R-BERT and BERT model with entity starts. For each model, we compared two pre-trained BERT models: FPTAI/vibert and NlpHUST/vibert4news. We found that NlpHUST/vibert4news model significantly outperforms FPTAI/vibert for the Vietnamese relation extraction task. Finally, we proposed an ensemble model that combines R-BERT and BERT with entity starts. Our proposed ensemble model slightly improved against two single models on the development data and the test data provided by the task organizers.

View on arXiv PDF

Similar