ROOT13: Spotting Hypernyms, Co-Hyponyms and Randoms
This work addresses a specific NLP task for lexical semantics, offering incremental improvements over existing methods.
The paper tackles the problem of classifying word pairs into hypernyms, co-hyponyms, and random words using a supervised system called ROOT13, which achieves an F1 score of 88.3% in a three-class setting and up to 97.3% in binary classifications, outperforming baselines by significant margins.
In this paper, we describe ROOT13, a supervised system for the classification of hypernyms, co-hyponyms and random words. The system relies on a Random Forest algorithm and 13 unsupervised corpus-based features. We evaluate it with a 10-fold cross validation on 9,600 pairs, equally distributed among the three classes and involving several Parts-Of-Speech (i.e. adjectives, nouns and verbs). When all the classes are present, ROOT13 achieves an F1 score of 88.3%, against a baseline of 57.6% (vector cosine). When the classification is binary, ROOT13 achieves the following results: hypernyms-co-hyponyms (93.4% vs. 60.2%), hypernymsrandom (92.3% vs. 65.5%) and co-hyponyms-random (97.3% vs. 81.5%). Our results are competitive with stateof-the-art models.