CLSep 11, 2022

Probing for Understanding of English Verb Classes and Alternations in Large Pre-trained Language Models

David K. Yi, James V. Bruno, Jiayu Han, Peter Zukerman, Shane Steinert-Threlkeld

arXiv:2209.04811v123.9291 citationsh-index: 14

Originality Synthesis-oriented

AI Analysis

This provides insights into linguistic knowledge in PLMs for NLP researchers, but it is incremental as it builds on prior probing work.

The study investigated whether large pre-trained language models encode English verb alternation classes, finding that contextual embeddings achieve high accuracies on probing tasks and that middle-to-upper layers perform better than lower layers.

We investigate the extent to which verb alternation classes, as described by Levin (1993), are encoded in the embeddings of Large Pre-trained Language Models (PLMs) such as BERT, RoBERTa, ELECTRA, and DeBERTa using selectively constructed diagnostic classifiers for word and sentence-level prediction tasks. We follow and expand upon the experiments of Kann et al. (2019), which aim to probe whether static embeddings encode frame-selectional properties of verbs. At both the word and sentence level, we find that contextual embeddings from PLMs not only outperform non-contextual embeddings, but achieve astonishingly high accuracies on tasks across most alternation classes. Additionally, we find evidence that the middle-to-upper layers of PLMs achieve better performance on average than the lower layers across all probing tasks.

View on arXiv PDF

Similar