CLMar 23, 2024

General LLMs as Instructors for Domain-Specific LLMs: A Sequential Fusion Method to Integrate Extraction and Editing

Xin Zhang, Tianjie Ju, Huijia Liang, Ying Fu, Qin Zhang

arXiv:2403.15736v22.71 citationsh-index: 1

Originality Incremental advance

AI Analysis

This addresses the challenge of efficiently updating LLMs for domain-specific applications with few-shot data, which is incremental but practically useful for fields like medicine and economics.

The paper tackles the problem of updating domain-specific LLMs with limited domain-expert reasoning data by introducing a Sequential Fusion method that uses general LLMs for relation extraction followed by knowledge editing, achieving accuracy gains of 39.1% to 45.0% in question-answering tasks across medical and economics-management domains.

The substantial interest in updating Large Language Models (LLMs) without retraining from scratch is accompanied by several challenges. This is particularly true when updating LLMs with datasets that necessitate domain-expert reasoning across extensive texts, despite limited samples. We termed the scenario as the Few-Shot Domain-Expert Reasoning for Updating LLMs (FDoR-UL). Traditional methods such as Low-Rank Adaptation (LoRA) and Retrieval Augmented Generation (RAG) are inadequate for addressing this critical issue, particularly evident in our exploration of a specific medical dataset that epitomizes the distinct needs of FDoR-UL. To tackle this challenge, we introduce a Sequential Fusion method to integrate knowledge from complex contexts into LLMs. This method employs a two-stage framework: initially leveraging general LLMs to perform relation extraction for knowledge acquisition from complex texts, followed by updating domain-specific LLMs through Knowledge Editing (KE). Employing our method, domain-specific LLMs achieved a 71.7% accuracy (an average gain of 39.1%) in question-answering tasks. Furthermore, we expanded our evaluation to a novel economics-management dataset we developed, where our method achieved a 75.0% accuracy (an average gain of 45.0%). These findings underscore the effectiveness and flexibility of our approach in FDoR-UL across various domains.

View on arXiv PDF

Similar