A new semantically annotated corpus with syntactic-semantic and cross-lingual senses
This provides a resource for French WSD, but is incremental as it extends existing annotation schemes to a new language and verb set.
The authors created a new sense-tagged corpus for word sense disambiguation, annotated with three sense labels for 20 French polysemous verbs, enabling cross-lingual and syntactic-semantic analysis.
We describe a new sense-tagged corpus for word sense disambiguation. The corpus is constituted of instances of 20 French polysemous verbs. Each verb instance is annotated with three sense labels: (1) the actual translation of the verb in the english version of this instance in a parallel corpus, (2) an entry of the verb in a computational dictionary of French (the Lexicon-Grammar tables) and (3) a fine-grained sense label resulting from the concatenation of the translation and the Lexicon-Grammar entry.