CLJun 28, 2024

Le sens de la famille : analyse du vocabulaire de la parent{é} par les plongements de mots

arXiv:2406.19729v1
Originality Synthesis-oriented
AI Analysis

This provides a computational linguistics method for analyzing structured lexical domains in French, though it is incremental in applying existing techniques to new data.

The study analyzed the vocabulary of French family relationships using distributional analysis on a lexicon of 25 nouns, showing that distributional information captures organizational features like descent, alliance, siblings, and genre, with variations across different corpora.

In this study, we propose a corpus analysis of an area of the French lexicon that is both dense and highly structured: the vocabulary of family relationships. Starting with a lexicon of 25 nouns designating the main relationships (son, cousin, mother, grandfather, sister-in-law etc.), we examine how these terms are positioned in relation to each other through distributional analyses based on the use of these terms in corpora. We show that distributional information can capture certain features that organize this vocabulary (descent, alliance, siblings, genre), in ways that vary according to the different corpora compared.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes