CLDec 24, 2024

Distilling Fine-grained Sentiment Understanding from Large Language Models

arXiv:2412.18552v21 citationsh-index: 9Has Code
Originality Incremental advance
AI Analysis

It addresses the computational efficiency problem for NLP practitioners by enabling cost-effective sentiment analysis with incremental improvements.

This paper tackles the high inference cost of using large language models (LLMs) for fine-grained sentiment analysis (FSA) by distilling sentiment understanding from LLMs into small language models (SLMs), resulting in a 6.00% improvement in F1-score and enabling SLMs to match or exceed teacher models in zero-shot classification.

Fine-grained sentiment analysis (FSA) aims to extract and summarize user opinions from vast opinionated text. Recent studies demonstrate that large language models (LLMs) possess exceptional sentiment understanding capabilities. However, directly deploying LLMs for FSA applications incurs high inference costs. Therefore, this paper investigates the distillation of fine-grained sentiment understanding from LLMs into small language models (SLMs). We prompt LLMs to examine and interpret the sentiments of given reviews and then utilize the generated content to pretrain SLMs. Additionally, we develop a comprehensive FSA benchmark to evaluate both SLMs and LLMs. Extensive experiments on this benchmark reveal that: (1) distillation significantly enhances the performance of SLMs in FSA tasks, achieving a 6.00\% improvement in $F_1$-score, and the distilled model can outperform Llama-2-7b with only 220M parameters; (2) distillation equips SLMs with excellent zero-shot sentiment classification capabilities, enabling them to match or even exceed their teacher models. These results suggest that distillation from LLMs is a highly promising direction for FSA. We will release our code, data, and pretrained model weights at https://github.com/HITSZ-HLT/FSA-Distillation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes