CLAILGJun 8, 2024

CERET: Cost-Effective Extrinsic Refinement for Text Generation

arXiv:2406.05588v231 citations
AI Analysis

This addresses the scalability and cost issues in LLM refinement for text generation tasks, offering a more efficient alternative to existing methods.

The paper tackles the problem of improving text generation quality from Large Language Models (LLMs) without high computational costs, proposing CERET, which outperforms baselines by ~1.6% in Rouge-1 for summarization and ~3.5% in hit rate for question answering while reducing latency to 9.4% of a self-rerank method.

Large Language Models (LLMs) are powerful models for generation tasks, but they may not generate good quality outputs in their first attempt. Apart from model fine-tuning, existing approaches to improve prediction accuracy and quality typically involve LLM self-improvement / self-reflection that incorporate feedback from models themselves. Despite their effectiveness, these methods are hindered by their high computational cost and lack of scalability. In this work, we propose CERET, a method for refining text generations by considering semantic stability, entailment and inter-sample uncertainty measures. Experimental results show that CERET outperforms Self-consistency and Self-rerank baselines consistently under various task setups, by ~1.6% in Rouge-1 for abstractive summarization and ~3.5% in hit rate for question answering. Compared to LLM Self-rerank method, our approach only requires 9.4% of its latency and is more cost-effective.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes