Prior Polarity Lexical Resources for the Italian Language
This provides a foundational tool for opinion mining and sentiment analysis in Italian, addressing a gap for researchers and practitioners in natural language processing.
The paper tackles the lack of prior polarity lexical resources for Italian by presenting SABRINA, a manually annotated resource consisting of over 277,000 words tagged with polarity values and over 200 polarity modifiers, resulting in the first such resource for Italian.
In this paper we present SABRINA (Sentiment Analysis: a Broad Resource for Italian Natural language Applications) a manually annotated prior polarity lexical resource for Italian natural language applications in the field of opinion mining and sentiment induction. The resource consists in two different sets, an Italian dictionary of more than 277.000 words tagged with their prior polarity value, and a set of polarity modifiers, containing more than 200 words, which can be used in combination with non neutral terms of the dictionary in order to induce the sentiment of Italian compound terms. To the best of our knowledge this is the first prior polarity manually annotated resource which has been developed for the Italian natural language.