CLAug 19, 2024

A Strategy to Combine 1stGen Transformers and Open LLMs for Automatic Text Classification

arXiv:2408.09629v16 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses cost-sensitive scenarios in NLP by offering a practical solution for sentiment analysis, though it is incremental in combining existing methods.

The study tackled the problem of high costs for fine-tuning large language models (LLMs) in text classification by proposing a confidence-based strategy that integrates first-generation transformers (1stTRs) with open LLMs, achieving performance competitive with fine-tuned LLMs at a fraction of the cost.

Transformer models have achieved state-of-the-art results, with Large Language Models (LLMs), an evolution of first-generation transformers (1stTR), being considered the cutting edge in several NLP tasks. However, the literature has yet to conclusively demonstrate that LLMs consistently outperform 1stTRs across all NLP tasks. This study compares three 1stTRs (BERT, RoBERTa, and BART) with two open LLMs (Llama 2 and Bloom) across 11 sentiment analysis datasets. The results indicate that open LLMs may moderately outperform or match 1stTRs in 8 out of 11 datasets but only when fine-tuned. Given this substantial cost for only moderate gains, the practical applicability of these models in cost-sensitive scenarios is questionable. In this context, a confidence-based strategy that seamlessly integrates 1stTRs with open LLMs based on prediction certainty is proposed. High-confidence documents are classified by the more cost-effective 1stTRs, while uncertain cases are handled by LLMs in zero-shot or few-shot modes, at a much lower cost than fine-tuned versions. Experiments in sentiment analysis demonstrate that our solution not only outperforms 1stTRs, zero-shot, and few-shot LLMs but also competes closely with fine-tuned LLMs at a fraction of the cost.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes