CLMay 6, 2023

NorBench -- A Benchmark for Norwegian Language Models

arXiv:2305.03880v1258 citations
Originality Synthesis-oriented
AI Analysis

This addresses the problem of evaluating Norwegian LMs for researchers and practitioners, but it is incremental as it adapts existing benchmarking approaches to a specific language.

The authors tackled the lack of standardized evaluation for Norwegian language models by introducing NorBench, a benchmark suite with tasks and probes, and new models, resulting in performance comparisons across tests.

We present NorBench: a streamlined suite of NLP tasks and probes for evaluating Norwegian language models (LMs) on standardized data splits and evaluation metrics. We also introduce a range of new Norwegian language models (both encoder and encoder-decoder based). Finally, we compare and analyze their performance, along with other existing LMs, across the different benchmark tests of NorBench.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes