SlovakBERT: Slovak Masked Language Model
This work addresses the lack of Slovak language models for NLP practitioners, though it is incremental as it applies existing methods to a new language.
The authors introduced SlovakBERT, the first transformer-based language model for Slovak, and achieved state-of-the-art results on several NLP tasks, establishing the first benchmark for Slovak language models.
We introduce a new Slovak masked language model called SlovakBERT. This is to our best knowledge the first paper discussing Slovak transformers-based language models. We evaluate our model on several NLP tasks and achieve state-of-the-art results. This evaluation is likewise the first attempt to establish a benchmark for Slovak language models. We publish the masked language model, as well as the fine-tuned models for part-of-speech tagging, sentiment analysis and semantic textual similarity.