CLAILGOct 24, 2024

Prompting and Fine-Tuning of Small LLMs for Length-Controllable Telephone Call Summarization

arXiv:2410.18624v15 citationsh-index: 112024 2nd International Conference on Foundation and Large Language Models (FLLM)
Originality Incremental advance
AI Analysis

This work addresses the need for efficient and customizable call summarization systems, though it is incremental as it adapts existing methods to a specific domain.

The paper tackled telephone call summarization by fine-tuning a small LLM (Llama-2-7B) with a synthetic dataset to control summary length, achieving performance on-par with GPT-4 in factual accuracy, completeness, and conciseness.

This paper explores the rapid development of a telephone call summarization system utilizing large language models (LLMs). Our approach involves initial experiments with prompting existing LLMs to generate summaries of telephone conversations, followed by the creation of a tailored synthetic training dataset utilizing stronger frontier models. We place special focus on the diversity of the generated data and on the ability to control the length of the generated summaries to meet various use-case specific requirements. The effectiveness of our method is evaluated using two state-of-the-art LLM-as-a-judge-based evaluation techniques to ensure the quality and relevance of the summaries. Our results show that fine-tuned Llama-2-7B-based summarization model performs on-par with GPT-4 in terms of factual accuracy, completeness and conciseness. Our findings demonstrate the potential for quickly bootstrapping a practical and efficient call summarization system.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes