CLJul 24, 2024

Bailicai: A Domain-Optimized Retrieval-Augmented Generation Framework for Medical Applications

Cui Long, Yongbin Liu, Chunping Ouyang, Ying Yu

arXiv:2407.21055v15.515 citationsh-index: 11Has Code

Originality Incremental advance

AI Analysis

This addresses the need for more reliable and effective AI in medical applications, though it appears incremental as it builds on existing RAG methods with domain-specific optimizations.

The study tackled the problem of open-source LLMs underperforming in medical applications due to knowledge gaps and hallucinations by introducing the Bailicai framework, which integrates retrieval-augmented generation with domain optimization, resulting in performance surpassing existing medical LLMs and GPT-3.5 on multiple benchmarks while reducing hallucinations and noise issues.

Large Language Models (LLMs) have exhibited remarkable proficiency in natural language understanding, prompting extensive exploration of their potential applications across diverse domains. In the medical domain, open-source LLMs have demonstrated moderate efficacy following domain-specific fine-tuning; however, they remain substantially inferior to proprietary models such as GPT-4 and GPT-3.5. These open-source models encounter limitations in the comprehensiveness of domain-specific knowledge and exhibit a propensity for 'hallucinations' during text generation. To mitigate these issues, researchers have implemented the Retrieval-Augmented Generation (RAG) approach, which augments LLMs with background information from external knowledge bases while preserving the model's internal parameters. However, document noise can adversely affect performance, and the application of RAG in the medical field remains in its nascent stages. This study presents the Bailicai framework: a novel integration of retrieval-augmented generation with large language models optimized for the medical domain. The Bailicai framework augments the performance of LLMs in medicine through the implementation of four sub-modules. Experimental results demonstrate that the Bailicai approach surpasses existing medical domain LLMs across multiple medical benchmarks and exceeds the performance of GPT-3.5. Furthermore, the Bailicai method effectively attenuates the prevalent issue of hallucinations in medical applications of LLMs and ameliorates the noise-related challenges associated with traditional RAG techniques when processing irrelevant or pseudo-relevant documents.

View on arXiv PDF

Similar