CL LGFeb 16, 2025

RoseRAG: Robust Retrieval-augmented Generation with Small-scale LLMs via Margin-aware Preference Optimization

Tianci Liu, Haoxiang Jiang, Tianze Wang, Ran Xu, Yue Yu, Linjun Zhang, Tuo Zhao, Haoyu Wang

arXiv:2502.10993v115.514 citationsh-index: 11ACL

Originality Highly original

AI Analysis

This addresses the challenge of deploying efficient small-scale LLMs in resource-constrained settings by improving their reliability in retrieval-augmented generation, though it is incremental as it builds on existing RAG methods.

The paper tackles the problem of small-scale LLMs struggling with noisy retrieved knowledge in RAG by proposing RoseRAG, which uses margin-aware preference optimization to enhance robustness, achieving significant improvements over state-of-the-art baselines on open-domain QA benchmarks.

Large language models (LLMs) have achieved impressive performance but face high computational costs and latency, limiting their deployment in resource-constrained settings. In contrast, small-scale LLMs (SLMs) are more efficient yet struggle to capture evolving real-world knowledge. Retrieval-augmented generation (RAG) helps by integrating external knowledge, but imperfect retrieval can introduce distracting noise that misleads SLMs. We propose RoseRAG, a robust RAG framework for SLMs via Margin-aware Preference Optimization. RoseRAG employs multi-turn prompting for detailed reasoning, rejection sampling for high-quality explanations, and contrastive preference selection to refine responses by maximizing the likelihood gap between preferred and non-preferred outputs. By integrating these components into a margin-aware optimization process, RoseRAG robustly enhances the accuracy and reliability of SLMs for RAG applications. Extensive experiments on three open-domain question answering benchmarks indicate that our innovative RoseRAG surpasses state-of-the-art baselines significantly.

View on arXiv PDF

Similar