CLAINov 17, 2023

A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest

CMU
arXiv:2311.10614v13 citationsh-index: 10
Originality Incremental advance
AI Analysis

This addresses the problem of LLMs struggling with knowledge-demanding queries in specific domains, offering a self-improvement pathway, though it is incremental as it builds on existing fine-tuning methods.

The paper tackles the challenge of enhancing large language models (LLMs) for domain-specific queries by introducing a two-step approach that mines question-answer pairs from documents and uses them to fine-tune a chatbot, resulting in performance improvements over general and domain-adapted models with minimal human intervention using only 600 seed instances.

Large Language Models (LLMs), despite their great power in language generation, often encounter challenges when dealing with intricate and knowledge-demanding queries in specific domains. This paper introduces a novel approach to enhance LLMs by effectively extracting the relevant knowledge from domain-specific textual sources, and the adaptive training of a chatbot with domain-specific inquiries. Our two-step approach starts from training a knowledge miner, namely LLMiner, which autonomously extracts Question-Answer pairs from relevant documents through a chain-of-thought reasoning process. Subsequently, we blend the mined QA pairs with a conversational dataset to fine-tune the LLM as a chatbot, thereby enriching its domain-specific expertise and conversational capabilities. We also developed a new evaluation benchmark which comprises four domain-specific text corpora and associated human-crafted QA pairs for testing. Our model shows remarkable performance improvement over generally aligned LLM and surpasses domain-adapted models directly fine-tuned on domain corpus. In particular, LLMiner achieves this with minimal human intervention, requiring only 600 seed instances, thereby providing a pathway towards self-improvement of LLMs through model-synthesized training data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes