Annotating Compositionality Scores for Irish Noun Compounds is Hard Work
This study addresses the problem of noun compound annotation for the Irish language, which is significant for NLP applications in this domain.
The authors analyzed Irish noun compounds and found that annotating compositionality scores is challenging, with variability in idiomaticity and interpretation. The study contributed to a greater understanding of how these constructions appear in the Irish language.
Noun compounds constitute a challenging construction for NLP applications, given their variability in idiomaticity and interpretation. In this paper, we present an analysis of compound nouns identified in Irish text of varied domains by expert annotators, focusing on compositionality as a key feature, but also domain specificity, as well as familiarity and confidence of the annotator giving the ratings. Our findings and the discussion that ensued contributes towards a greater understanding of how these constructions appear in Irish language, and how they might be treated separately from English noun compounds.