CLAug 26, 2019

Does BERT agree? Evaluating knowledge of structure dependence through agreement relations

arXiv:1908.09892v122 citations
AI Analysis

This work addresses the problem of assessing syntactic knowledge in pre-trained models for NLP researchers, but it is incremental as it extends prior work on agreement phenomena.

The study evaluated BERT models' ability to capture structure-dependent agreement relations across 26 languages, finding they perform well overall but degrade in specific linguistic contexts.

Learning representations that accurately model semantics is an important goal of natural language processing research. Many semantic phenomena depend on syntactic structure. Recent work examines the extent to which state-of-the-art models for pre-training representations, such as BERT, capture such structure-dependent phenomena, but is largely restricted to one phenomenon in English: number agreement between subjects and verbs. We evaluate BERT's sensitivity to four types of structure-dependent agreement relations in a new semi-automatically curated dataset across 26 languages. We show that both the single-language and multilingual BERT models capture syntax-sensitive agreement patterns well in general, but we also highlight the specific linguistic contexts in which their performance degrades.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes