CLAILGOct 7, 2025

DACP: Domain-Adaptive Continual Pre-Training of Large Language Models for Phone Conversation Summarization

arXiv:2510.05858v32 citationsh-index: 20Proceedings of The 5th New Frontiers in Summarization Workshop
Originality Incremental advance
AI Analysis

This addresses the challenge of improving summarization for noisy real-world conversations in industrial applications, offering a scalable alternative to costly fine-tuning, though it is incremental as it builds on existing continual pre-training methods.

The paper tackled the problem of adapting large language models for summarization in specialized domains like phone conversations, where performance often drops due to distribution shifts, and found that continual pre-training on unlabeled business conversation data yields substantial gains in both in-domain and out-of-domain benchmarks.

Large language models (LLMs) have achieved impressive performance in text summarization, yet their performance often falls short when applied to specialized domains that differ from their original pre-training distribution. While fine-tuning can improve summarization quality, it typically relies on costly and scarce high-quality labeled data. In this work, we explore continual pre-training as a scalable, self-supervised approach to adapt LLMs for downstream summarization tasks, particularly in the context of noisy real-world conversation transcripts. We conduct extensive experiments using large-scale, unlabeled business conversation data to investigate whether continual pre-training enhances model capabilities in conversational summarization. Our results demonstrate that continual pre-training yields substantial gains in both in-domain and out-of-domain summarization benchmarks, while maintaining strong generalization and robustness. We also analyze the effects of data selection strategies, providing practical guidelines for applying continual pre-training in summarization-focused industrial applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes