CL IRMar 31, 2017

Opinion Mining on Non-English Short Text

arXiv:1704.00016v20.3

Originality Incremental advance

AI Analysis

This work addresses sentiment analysis for languages with limited resources, such as Turkish, by enabling detection of mixed sentiments on a multi-variant scale.

The paper tackled opinion mining on non-English short texts by proposing a method that projects text into dense, low-dimensional feature vectors based on sentiment strength, achieving good results on Turkish tweets.

As the type and the number of such venues increase, automated analysis of sentiment on textual resources has become an essential data mining task. In this paper, we investigate the problem of mining opinions on the collection of informal short texts. Both positive and negative sentiment strength of texts are detected. We focus on a non-English language that has few resources for text mining. This approach would help enhance the sentiment analysis in languages where a list of opinionated words does not exist. We propose a new method projects the text into dense and low dimensional feature vectors according to the sentiment strength of the words. We detect the mixture of positive and negative sentiments on a multi-variant scale. Empirical evaluation of the proposed framework on Turkish tweets shows that our approach gets good results for opinion mining.

View on arXiv PDF

Similar