CLOct 6, 2020

Intrinsic Probing through Dimension Selection

arXiv:2010.02812v11012 citations
Originality Incremental advance
AI Analysis

This addresses the problem of understanding linguistic structure in NLP representations for researchers, offering a more direct probing method, though it is incremental in refining existing probing techniques.

The paper distinguishes intrinsic from extrinsic probing and introduces a decomposable multivariate Gaussian probe to assess whether linguistic information in word embeddings is dispersed or focal, finding that most morphosyntactic attributes are encoded by few neurons, with fastText concentrating structure more than BERT across 36 languages.

Most modern NLP systems make use of pre-trained contextual representations that attain astonishingly high performance on a variety of tasks. Such high performance should not be possible unless some form of linguistic structure inheres in these representations, and a wealth of research has sprung up on probing for it. In this paper, we draw a distinction between intrinsic probing, which examines how linguistic information is structured within a representation, and the extrinsic probing popular in prior work, which only argues for the presence of such information by showing that it can be successfully extracted. To enable intrinsic probing, we propose a novel framework based on a decomposable multivariate Gaussian probe that allows us to determine whether the linguistic information in word embeddings is dispersed or focal. We then probe fastText and BERT for various morphosyntactic attributes across 36 languages. We find that most attributes are reliably encoded by only a few neurons, with fastText concentrating its linguistic structure more than BERT.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes