Towards Syntactic Iberian Polarity Classification
This work addresses the problem of language dependency in polarity classification for researchers and practitioners in multilingual NLP, though it is incremental as it extends existing methods to new languages.
The authors tackled the challenge of adapting lexicon-based polarity classification methods across multiple languages by developing the first symbolic syntax-based system for Iberian languages, which supports Basque, Catalan, Galician, Portuguese, and Spanish with shared rules and is publicly available.
Lexicon-based methods using syntactic rules for polarity classification rely on parsers that are dependent on the language and on treebank guidelines. Thus, rules are also dependent and require adaptation, especially in multilingual scenarios. We tackle this challenge in the context of the Iberian Peninsula, releasing the first symbolic syntax-based Iberian system with rules shared across five official languages: Basque, Catalan, Galician, Portuguese and Spanish. The model is made available.