CLLGApr 22, 2024

What do Transformers Know about Government?

arXiv:2404.14270v182 citationsh-index: 27LREC
Originality Incremental advance
AI Analysis

This addresses a lack of data for researchers studying grammatical constructions, particularly government relations, by providing a new dataset.

The paper investigates how BERT encodes government relations in sentences, finding that this information is present across all layers, especially early ones, and that a few attention heads enable a classifier to discover new types of government not seen in training.

This paper investigates what insights about linguistic features and what knowledge about the structure of natural language can be obtained from the encodings in transformer language models.In particular, we explore how BERT encodes the government relation between constituents in a sentence. We use several probing classifiers, and data from two morphologically rich languages. Our experiments show that information about government is encoded across all transformer layers, but predominantly in the early layers of the model. We find that, for both languages, a small number of attention heads encode enough information about the government relations to enable us to train a classifier capable of discovering new, previously unknown types of government, never seen in the training data. Currently, data is lacking for the research community working on grammatical constructions, and government in particular. We release the Government Bank -- a dataset defining the government relations for thousands of lemmas in the languages in our experiments.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes