CLAIHCFeb 24, 2025

Bridging Information Gaps with Comprehensive Answers: Improving the Diversity and Informativeness of Follow-Up Questions

arXiv:2502.17715v24 citationsh-index: 10SEM
Originality Incremental advance
AI Analysis

This work addresses the problem of improving conversational agents for resource-constrained systems, though it is incremental as it builds on existing knowledge distillation and dataset augmentation techniques.

The paper tackled the challenge of generating diverse and informative follow-up questions for conversational agents using small models by developing an information-gap-driven knowledge distillation pipeline that augments a dataset tenfold and fine-tunes student models, resulting in significantly higher informativeness and diversity compared to baseline models.

Generating diverse follow-up questions that uncover missing information remains challenging for conversational agents, particularly when they run on small, locally hosted models. To address this, we develop an information-gap-driven knowledge distillation pipeline in which a teacher LLM generates a comprehensive answer, contrasts it with the initial answer to identify information gaps, and formulates gap-bridging follow-up questions. Using this pipeline, we augment the existing FollowupQG dataset tenfold. We then fine-tune smaller student models on the augmented dataset to distill the teacher's knowledge. Experiments with selected teacher-student model pairs show that fine-tuned students achieve significantly higher informativeness and diversity than variations trained on the original dataset. These findings indicate that our pipeline, which mirrors the human cognitive process of information seeking, provides an efficient distillation channel from state-of-the-art LLMs to smaller models, enabling resource-constrained conversational systems to generate more diverse and informative follow-up questions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes