CL AIApr 10, 2023

Is ChatGPT a Good Sentiment Analyzer? A Preliminary Study

Zengzhi Wang, Qiming Xie, Yi Feng, Zixiang Ding, Zinong Yang, Rui Xia

arXiv:2304.04339v217.6212 citationsh-index: 22Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of assessing ChatGPT's utility for sentiment analysis tasks for researchers and practitioners, though it is incremental as a preliminary evaluation.

The study evaluated ChatGPT's performance as a sentiment analyzer across 7 tasks and 17 benchmark datasets, finding it competitive with fine-tuned BERT and SOTA models in some settings but with limitations in handling polarity shifts and open-domain scenarios.

Recently, ChatGPT has drawn great attention from both the research community and the public. We are particularly interested in whether it can serve as a universal sentiment analyzer. To this end, in this work, we provide a preliminary evaluation of ChatGPT on the understanding of \emph{opinions}, \emph{sentiments}, and \emph{emotions} contained in the text. Specifically, we evaluate it in three settings, including \emph{standard} evaluation, \emph{polarity shift} evaluation and \emph{open-domain} evaluation. We conduct an evaluation on 7 representative sentiment analysis tasks covering 17 benchmark datasets and compare ChatGPT with fine-tuned BERT and corresponding state-of-the-art (SOTA) models on them. We also attempt several popular prompting techniques to elicit the ability further. Moreover, we conduct human evaluation and present some qualitative case studies to gain a deep comprehension of its sentiment analysis capabilities.

View on arXiv PDF Code

Similar