CL LGAug 21, 2023

Zero- and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis

Md. Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker, Sheak Rashed Haider Noori

U of Toronto

arXiv:2308.10783v218.788 citationsh-index: 38Has Code

Originality Synthesis-oriented

AI Analysis

This addresses the problem of under-researched sentiment analysis for Bangla, a low-resource language, but is incremental as it applies existing methods to new data.

The study tackled sentiment analysis for the low-resource language Bangla by creating a dataset of 33,606 annotated tweets and comments, and found that monolingual transformer-based models outperformed other models, including in zero- and few-shot scenarios.

The rapid expansion of the digital world has propelled sentiment analysis into a critical tool across diverse sectors such as marketing, politics, customer service, and healthcare. While there have been significant advancements in sentiment analysis for widely spoken languages, low-resource languages, such as Bangla, remain largely under-researched due to resource constraints. Furthermore, the recent unprecedented performance of Large Language Models (LLMs) in various applications highlights the need to evaluate them in the context of low-resource languages. In this study, we present a sizeable manually annotated dataset encompassing 33,606 Bangla news tweets and Facebook comments. We also investigate zero- and few-shot in-context learning with several language models, including Flan-T5, GPT-4, and Bloomz, offering a comparative analysis against fine-tuned models. Our findings suggest that monolingual transformer-based models consistently outperform other models, even in zero and few-shot scenarios. To foster continued exploration, we intend to make this dataset and our research tools publicly available to the broader research community.

View on arXiv PDF Code

Similar