CL AI SINov 19, 2015

Overcoming Language Variation in Sentiment Analysis with Social Attention

arXiv:1511.06052v422.193 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses sentiment analysis robustness for social media and review platforms, offering a method to handle author-specific language variations without labeled data, though it is incremental in applying social network insights to existing models.

The paper tackled the problem of language variation in sentiment analysis by exploiting social networks and linguistic homophily, resulting in a novel attention-based neural network that significantly improved accuracies on Twitter and review data.

Variation in language is ubiquitous, particularly in newer forms of writing such as social media. Fortunately, variation is not random, it is often linked to social properties of the author. In this paper, we show how to exploit social networks to make sentiment analysis more robust to social language variation. The key idea is linguistic homophily: the tendency of socially linked individuals to use language in similar ways. We formalize this idea in a novel attention-based neural network architecture, in which attention is divided among several basis models, depending on the author's position in the social network. This has the effect of smoothing the classification function across the social network, and makes it possible to induce personalized classifiers even for authors for whom there is no labeled data or demographic metadata. This model significantly improves the accuracies of sentiment analysis on Twitter and on review data.

View on arXiv PDF Code

Similar