CLApr 10, 2020

Overestimation of Syntactic Representationin Neural Language Models

arXiv:2004.05067v113 citations
AI Analysis

This is an incremental critique that challenges the validity of existing testing methodologies for researchers studying language model representations.

The paper identifies a fundamental problem in a popular method for probing syntactic representations in neural language models, showing that positive results can be reproduced using non-syntactic baselines like an n-gram model and an LSTM trained on scrambled inputs.

With the advent of powerful neural language models over the last few years, research attention has increasingly focused on what aspects of language they represent that make them so successful. Several testing methodologies have been developed to probe models' syntactic representations. One popular method for determining a model's ability to induce syntactic structure trains a model on strings generated according to a template then tests the model's ability to distinguish such strings from superficially similar ones with different syntax. We illustrate a fundamental problem with this approach by reproducing positive results from a recent paper with two non-syntactic baseline language models: an n-gram model and an LSTM model trained on scrambled inputs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes