CLAILGJun 24, 2024

Paraphrase and Aggregate with Large Language Models for Minimizing Intent Classification Errors

arXiv:2406.17163v128 citations
Originality Incremental advance
AI Analysis

This addresses critical misclassification and hallucination issues in intent classification for applications using large language models, representing an incremental improvement.

The paper tackles classification errors and out-of-vocabulary label generation in large language models for intent classification by introducing the Paraphrase and Aggregate (PAG)-LLM approach, which reduces errors by 22.7% on CLINC and 15.1% on Banking datasets.

Large language models (LLM) have achieved remarkable success in natural language generation but lesser focus has been given to their applicability in decision making tasks such as classification. We show that LLMs like LLaMa can achieve high performance on large multi-class classification tasks but still make classification errors and worse, generate out-of-vocabulary class labels. To address these critical issues, we introduce Paraphrase and AGgregate (PAG)-LLM approach wherein an LLM generates multiple paraphrases of the input query (parallel queries), performs multi-class classification for the original query and each paraphrase, and at the end aggregate all the classification labels based on their confidence scores. We evaluate PAG-LLM on two large multi-class classication datasets: CLINC, and Banking and show 22.7% and 15.1% error reduction. We show that PAG-LLM is especially effective for hard examples where LLM is uncertain, and reduces the critical misclassification and hallucinated label generation errors

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes