CLSep 30, 2021

SlovakBERT: Slovak Masked Language Model

arXiv:2109.15254v2291 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the lack of Slovak language models for NLP practitioners, though it is incremental as it applies existing methods to a new language.

The authors introduced SlovakBERT, the first transformer-based language model for Slovak, and achieved state-of-the-art results on several NLP tasks, establishing the first benchmark for Slovak language models.

We introduce a new Slovak masked language model called SlovakBERT. This is to our best knowledge the first paper discussing Slovak transformers-based language models. We evaluate our model on several NLP tasks and achieve state-of-the-art results. This evaluation is likewise the first attempt to establish a benchmark for Slovak language models. We publish the masked language model, as well as the fine-tuned models for part-of-speech tagging, sentiment analysis and semantic textual similarity.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes