CL AINov 17, 2023

A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest

Ruohong Zhang, Luyu Gao, Chen Zheng, Zhen Fan, Guokun Lai, Zheng Zhang, Fangzhou Ai, Yiming Yang, Hongxia Yang

CMU

arXiv:2311.10614v11.73 citationsh-index: 10

Originality Incremental advance

AI Analysis

This addresses the problem of LLMs struggling with knowledge-demanding queries in specific domains, offering a self-improvement pathway, though it is incremental as it builds on existing fine-tuning methods.

The paper tackles the challenge of enhancing large language models (LLMs) for domain-specific queries by introducing a two-step approach that mines question-answer pairs from documents and uses them to fine-tune a chatbot, resulting in performance improvements over general and domain-adapted models with minimal human intervention using only 600 seed instances.

Large Language Models (LLMs), despite their great power in language generation, often encounter challenges when dealing with intricate and knowledge-demanding queries in specific domains. This paper introduces a novel approach to enhance LLMs by effectively extracting the relevant knowledge from domain-specific textual sources, and the adaptive training of a chatbot with domain-specific inquiries. Our two-step approach starts from training a knowledge miner, namely LLMiner, which autonomously extracts Question-Answer pairs from relevant documents through a chain-of-thought reasoning process. Subsequently, we blend the mined QA pairs with a conversational dataset to fine-tune the LLM as a chatbot, thereby enriching its domain-specific expertise and conversational capabilities. We also developed a new evaluation benchmark which comprises four domain-specific text corpora and associated human-crafted QA pairs for testing. Our model shows remarkable performance improvement over generally aligned LLM and surpasses domain-adapted models directly fine-tuned on domain corpus. In particular, LLMiner achieves this with minimal human intervention, requiring only 600 seed instances, thereby providing a pathway towards self-improvement of LLMs through model-synthesized training data.

View on arXiv PDF

Similar