CLDec 18, 2020

An Empirical Study of Using Pre-trained BERT Models for Vietnamese Relation Extraction Task at VLSP 2020

arXiv:2012.10275v2821 citations
Originality Synthesis-oriented
AI Analysis

This work provides an incremental improvement for Vietnamese relation extraction by comparing and combining existing BERT-based models.

This paper empirically studies pre-trained BERT models for Vietnamese relation extraction, finding that the NlpHUST/vibert4news model significantly outperforms FPTAI/vibert. An ensemble model combining R-BERT and BERT with entity starts showed slight improvements over single models.

In this paper, we present an empirical study of using pre-trained BERT models for the relation extraction task at the VLSP 2020 Evaluation Campaign. We applied two state-of-the-art BERT-based models: R-BERT and BERT model with entity starts. For each model, we compared two pre-trained BERT models: FPTAI/vibert and NlpHUST/vibert4news. We found that NlpHUST/vibert4news model significantly outperforms FPTAI/vibert for the Vietnamese relation extraction task. Finally, we proposed an ensemble model that combines R-BERT and BERT with entity starts. Our proposed ensemble model slightly improved against two single models on the development data and the test data provided by the task organizers.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes