CL LGApr 22, 2024

What do Transformers Know about Government?

Jue Hou, Anisia Katinskaia, Lari Kotilainen, Sathianpong Trangcasanchai, Anh-Duc Vu, Roman Yangarber

arXiv:2404.14270v123.982 citationsh-index: 27Has CodeLREC

Originality Incremental advance

AI Analysis

This addresses a lack of data for researchers studying grammatical constructions, particularly government relations, by providing a new dataset.

The paper investigates how BERT encodes government relations in sentences, finding that this information is present across all layers, especially early ones, and that a few attention heads enable a classifier to discover new types of government not seen in training.

This paper investigates what insights about linguistic features and what knowledge about the structure of natural language can be obtained from the encodings in transformer language models.In particular, we explore how BERT encodes the government relation between constituents in a sentence. We use several probing classifiers, and data from two morphologically rich languages. Our experiments show that information about government is encoded across all transformer layers, but predominantly in the early layers of the model. We find that, for both languages, a small number of attention heads encode enough information about the government relations to enable us to train a classifier capable of discovering new, previously unknown types of government, never seen in the training data. Currently, data is lacking for the research community working on grammatical constructions, and government in particular. We release the Government Bank -- a dataset defining the government relations for thousands of lemmas in the languages in our experiments.

View on arXiv PDF Code

Similar