CLIRDec 3, 2025

AR-Med: Automated Relevance Enhancement in Medical Search via LLM-Driven Information Augmentation

arXiv:2512.03737v1h-index: 1
Originality Incremental advance
AI Analysis

This addresses the critical need for reliable medical search to ensure user safety and service efficacy on healthcare platforms, representing a practical deployment rather than a fundamental breakthrough.

The paper tackles the problem of inaccurate medical search on online healthcare platforms by introducing AR-Med, a framework that uses LLMs with retrieval-augmentation and knowledge distillation, achieving over 93% offline accuracy and a 24% absolute improvement over the original online system.

Accurate and reliable search on online healthcare platforms is critical for user safety and service efficacy. Traditional methods, however, often fail to comprehend complex and nuanced user queries, limiting their effectiveness. Large language models (LLMs) present a promising solution, offering powerful semantic understanding to bridge this gap. Despite their potential, deploying LLMs in this high-stakes domain is fraught with challenges, including factual hallucinations, specialized knowledge gaps, and high operational costs. To overcome these barriers, we introduce \textbf{AR-Med}, a novel framework for \textbf{A}utomated \textbf{R}elevance assessment for \textbf{Med}ical search that has been successfully deployed at scale on the Online Medical Delivery Platforms. AR-Med grounds LLM reasoning in verified medical knowledge through a retrieval-augmented approach, ensuring high accuracy and reliability. To enable efficient online service, we design a practical knowledge distillation scheme that compresses large teacher models into compact yet powerful student models. We also introduce LocalQSMed, a multi-expert annotated benchmark developed to guide model iteration and ensure strong alignment between offline and online performance. Extensive experiments show AR-Med achieves an offline accuracy of over 93\%, a 24\% absolute improvement over the original online system, and delivers significant gains in online relevance and user satisfaction. Our work presents a practical and scalable blueprint for developing trustworthy, LLM-powered systems in real-world healthcare applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes