CLAINov 3, 2023

Abstraction via exemplars? A representational case study on lexical category inference in BERT

arXiv:2312.03708v15 citationsh-index: 21
Originality Incremental advance
AI Analysis

This addresses the debate on abstraction vs. exemplar-based learning in language acquisition for computational linguistics, though it is incremental as it adapts an existing experiment.

The study investigated whether neural language models like BERT can develop abstraction-like behavior from exemplar-based learning, finding that BERT generalizes novel token usage by moving their representations toward known category exemplars in a two-dimensional space.

Exemplar based accounts are often considered to be in direct opposition to pure linguistic abstraction in explaining language learners' ability to generalize to novel expressions. However, the recent success of neural network language models on linguistically sensitive tasks suggests that perhaps abstractions can arise via the encoding of exemplars. We provide empirical evidence for this claim by adapting an existing experiment that studies how an LM (BERT) generalizes the usage of novel tokens that belong to lexical categories such as Noun/Verb/Adjective/Adverb from exposure to only a single instance of their usage. We analyze the representational behavior of the novel tokens in these experiments, and find that BERT's capacity to generalize to unseen expressions involving the use of these novel tokens constitutes the movement of novel token representations towards regions of known category exemplars in two-dimensional space. Our results suggest that learners' encoding of exemplars can indeed give rise to abstraction like behavior.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes