CLJan 21, 2020

Where New Words Are Born: Distributional Semantic Analysis of Neologisms and Their Semantic Neighborhoods

arXiv:2001.07740v11000 citations
AI Analysis

This work addresses the linguistic question of how language-internal and external factors influence language change, though it is incremental as it applies existing methods to a new domain.

The study tackled the problem of predicting new word emergence in English by analyzing semantic sparsity and frequency growth rates of neighbors using distributional semantics, finding that both factors are predictive but with stronger support for frequency growth.

We perform statistical analysis of the phenomenon of neology, the process by which new words emerge in a language, using large diachronic corpora of English. We investigate the importance of two factors, semantic sparsity and frequency growth rates of semantic neighbors, formalized in the distributional semantics paradigm. We show that both factors are predictive of word emergence although we find more support for the latter hypothesis. Besides presenting a new linguistic application of distributional semantics, this study tackles the linguistic question of the role of language-internal factors (in our case, sparsity) in language change motivated by language-external factors (reflected in frequency growth).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes