CLSOC-PHMar 30, 2017

Neutral evolution and turnover over centuries of English word popularity

arXiv:1703.10698v12 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of understanding cultural evolution and vocabulary dynamics for linguists and computational social scientists, though it is incremental as it builds on existing neutral models.

The researchers tackled the problem of modeling the evolution of English word popularity over centuries by testing neutral models against historical corpus data, finding that a modified two-stage neutral model successfully replicates both static and dynamic properties of the data, unlike a commonly used model.

Here we test Neutral models against the evolution of English word frequency and vocabulary at the population scale, as recorded in annual word frequencies from three centuries of English language books. Against these data, we test both static and dynamic predictions of two neutral models, including the relation between corpus size and vocabulary size, frequency distributions, and turnover within those frequency distributions. Although a commonly used Neutral model fails to replicate all these emergent properties at once, we find that modified two-stage Neutral model does replicate the static and dynamic properties of the corpus data. This two-stage model is meant to represent a relatively small corpus (population) of English books, analogous to a `canon', sampled by an exponentially increasing corpus of books in the wider population of authors. More broadly, this mode -- a smaller neutral model within a larger neutral model -- could represent more broadly those situations where mass attention is focused on a small subset of the cultural variants.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes