CLMar 24, 2021

Czert -- Czech BERT-like Model for Language Representation

arXiv:2103.13031v3661 citations
Originality Incremental advance
AI Analysis

This provides a high-performance tool for Czech NLP tasks, though it is incremental as it adapts existing architectures to a new language.

The authors tackled the lack of a Czech monolingual language representation model by training BERT and ALBERT models on 340K sentences, outperforming multilingual models on 9 out of 11 datasets and setting new state-of-the-art results on nine datasets.

This paper describes the training process of the first Czech monolingual language representation models based on BERT and ALBERT architectures. We pre-train our models on more than 340K of sentences, which is 50 times more than multilingual models that include Czech data. We outperform the multilingual models on 9 out of 11 datasets. In addition, we establish the new state-of-the-art results on nine datasets. At the end, we discuss properties of monolingual and multilingual models based upon our results. We publish all the pre-trained and fine-tuned models freely for the research community.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes