AIAug 8, 2025

Retrieval Augmented Large Language Model System for Comprehensive Drug Contraindications

arXiv:2508.06145v12 citationsh-index: 2Health Inf Sci Syst
Originality Synthesis-oriented
AI Analysis

This addresses the critical need for reliable pharmaceutical contraindication data to reduce prescription errors, though it is an incremental application of existing RAG methods to a specific healthcare domain.

This study tackled the problem of inaccurate drug contraindication information from large language models in healthcare by implementing a Retrieval Augmented Generation pipeline, which improved accuracy from 0.49-0.57 to 0.87-0.94 across three categories.

The versatility of large language models (LLMs) has been explored across various sectors, but their application in healthcare poses challenges, particularly in the domain of pharmaceutical contraindications where accurate and reliable information is required. This study enhances the capability of LLMs to address contraindications effectively by implementing a Retrieval Augmented Generation (RAG) pipeline. Utilizing OpenAI's GPT-4o-mini as the base model, and the text-embedding-3-small model for embeddings, our approach integrates Langchain to orchestrate a hybrid retrieval system with re-ranking. This system leverages Drug Utilization Review (DUR) data from public databases, focusing on contraindications for specific age groups, pregnancy, and concomitant drug use. The dataset includes 300 question-answer pairs across three categories, with baseline model accuracy ranging from 0.49 to 0.57. Post-integration of the RAG pipeline, we observed a significant improvement in model accuracy, achieving rates of 0.94, 0.87, and 0.89 for contraindications related to age groups, pregnancy, and concomitant drug use, respectively. The results indicate that augmenting LLMs with a RAG framework can substantially reduce uncertainty in prescription and drug intake decisions by providing more precise and reliable drug contraindication information.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes