CLFeb 1, 2023

Inference of Partial Colexifications from Multilingual Wordlists

arXiv:2302.00739v18 citationsh-index: 31
Originality Incremental advance
AI Analysis

It addresses a computational bottleneck in linguistics for researchers studying language evolution and typology, but is incremental as it builds on existing colexification research.

This study tackles the problem of inferring partial colexifications, which involve parts of words rather than entire words, from multilingual wordlists, by proposing new models, efficient methods, and workflows for analysis and visualization.

The past years have seen a drastic rise in studies devoted to the investigation of colexification patterns in individual languages families in particular and the languages of the world in specific. Specifically computational studies have profited from the fact that colexification as a scientific construct is easy to operationalize, enabling scholars to infer colexification patterns for large collections of cross-linguistic data. Studies devoted to partial colexifications -- colexification patterns that do not involve entire words, but rather various parts of words--, however, have been rarely conducted so far. This is not surprising, since partial colexifications are less easy to deal with in computational approaches and may easily suffer from all kinds of noise resulting from false positive matches. In order to address this problem, this study proposes new approaches to the handling of partial colexifications by (1) proposing new models with which partial colexification patterns can be represented, (2) developing new efficient methods and workflows which help to infer various types of partial colexification patterns from multilingual wordlists, and (3) illustrating how inferred patterns of partial colexifications can be computationally analyzed and interactively visualized.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes