CLNov 8, 2024

Word reuse and combination support efficient communication of emerging concepts

arXiv:2411.05379v14 citationsh-index: 3PNAS
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of understanding lexicalization processes in linguistics, offering a unified account that bridges word meaning extension and word formation, though it is incremental in applying information theory to existing strategies.

The study tackled the problem of how languages efficiently lexicalize emerging concepts through word reuse and combination, finding that historically emerging items in English, French, and Finnish achieve higher communicative efficiency than hypothetical alternatives, with literal forms being more efficient than non-literal ones.

A key function of the lexicon is to express novel concepts as they emerge over time through a process known as lexicalization. The most common lexicalization strategies are the reuse and combination of existing words, but they have typically been studied separately in the areas of word meaning extension and word formation. Here we offer an information-theoretic account of how both strategies are constrained by a fundamental tradeoff between competing communicative pressures: word reuse tends to preserve the average length of word forms at the cost of less precision, while word combination tends to produce more informative words at the expense of greater word length. We test our proposal against a large dataset of reuse items and compounds that appeared in English, French and Finnish over the past century. We find that these historically emerging items achieve higher levels of communicative efficiency than hypothetical ways of constructing the lexicon, and both literal reuse items and compounds tend to be more efficient than their non-literal counterparts. These results suggest that reuse and combination are both consistent with a unified account of lexicalization grounded in the theory of efficient communication.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes