CLMay 30
French parsing enhanced with a word clustering method based on a syntactic lexiconAnthony Sigogne, Matthieu Constant, Eric Laporte
This article evaluates the integration of data extracted from a French syntactic lexicon, the Lexicon-Grammar (Gross, 1994), into a probabilistic parser. We show that by applying clustering methods on verbs of the French Treebank (Abeillé et al., 2003), we obtain accurate performances on French with a parser based on a Probabilistic Context-Free Grammar (Petrov et al., 2006).
CLMay 27
A new semantically annotated corpus with syntactic-semantic and cross-lingual sensesMyriam Rakho, Eric Laporte, Matthieu Constant
We describe a new sense-tagged corpus for word sense disambiguation. The corpus is constituted of instances of 20 French polysemous verbs. Each verb instance is annotated with three sense labels: (1) the actual translation of the verb in the english version of this instance in a parallel corpus, (2) an entry of the verb in a computational dictionary of French (the Lexicon-Grammar tables) and (3) a fine-grained sense label resulting from the concatenation of the translation and the Lexicon-Grammar entry.
CLApr 7, 2014
Intégration des données d'un lexique syntaxique dans un analyseur syntaxique probabilisteAnthony Sigogne, Matthieu Constant, Eric Laporte
This article reports the evaluation of the integration of data from a syntactic-semantic lexicon, the Lexicon-Grammar of French, into a syntactic parser. We show that by changing the set of labels for verbs and predicational nouns, we can improve the performance on French of a non-lexicalized probabilistic parser.