CL AI DB IRMay 28, 2025

Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems

Hoang Pham, Thuy-Duong Nguyen, Khac-Hoai Nam Bui

arXiv:2505.22571v34.91 citationsh-index: 16Has Code

Originality Incremental advance

AI Analysis

This addresses the need for more flexible and interpretable RAG systems for real-world question-answering applications, though it appears incremental as it builds on existing LLM agent concepts.

The paper tackles the problem of retrieval-augmented generation (RAG) systems being limited to either single-hop or multi-hop queries separately, proposing Agent-UniRAG, a trainable LLM agent framework that handles both query types in an end-to-end manner, achieving comparable performance to closed-source and larger open-source LLMs on various benchmarks.

This paper presents a novel approach for unified retrieval-augmented generation (RAG) systems using the recent emerging large language model (LLM) agent concept. Specifically, Agent LLM, which utilizes LLM as fundamental controllers, has become a promising approach to enable the interpretability of RAG tasks, especially for complex reasoning question-answering systems (e.g., multi-hop queries). Nonetheless, previous works mainly focus on solving RAG systems with either single-hop or multi-hop approaches separately, which limits the application of those approaches to real-world applications. In this study, we propose a trainable agent framework called Agent-UniRAG for unified retrieval-augmented LLM systems, which enhances the effectiveness and interpretability of RAG systems. The main idea is to design an LLM agent framework to solve RAG tasks step-by-step based on the complexity of the inputs, simultaneously including single-hop and multi-hop queries in an end-to-end manner. Furthermore, we introduce SynAgent-RAG, a synthetic dataset to enable the proposed agent framework for small open-source LLMs (e.g., Llama-3-8B). The results show comparable performances with closed-source and larger open-source LLMs across various RAG benchmarks. Our source code and dataset are publicly available for further exploitation.

View on arXiv PDF

Similar