CLMay 27

A new semantically annotated corpus with syntactic-semantic and cross-lingual senses

arXiv:2605.2849475.21 citations
AI Analysis

This provides a resource for French WSD, but is incremental as it extends existing annotation schemes to a new language and verb set.

The authors created a new sense-tagged corpus for word sense disambiguation, annotated with three sense labels for 20 French polysemous verbs, enabling cross-lingual and syntactic-semantic analysis.

We describe a new sense-tagged corpus for word sense disambiguation. The corpus is constituted of instances of 20 French polysemous verbs. Each verb instance is annotated with three sense labels: (1) the actual translation of the verb in the english version of this instance in a parallel corpus, (2) an entry of the verb in a computational dictionary of French (the Lexicon-Grammar tables) and (3) a fine-grained sense label resulting from the concatenation of the translation and the Lexicon-Grammar entry.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes