CLAIMar 31, 2025

Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement

arXiv:2503.23895v423 citationsh-index: 30Has Code
Originality Incremental advance
AI Analysis

This addresses practical adoption challenges in retrieval-augmented generation for real-world applications, though it appears incremental as it builds on existing PRAG methods.

The paper tackles the problem of high inference, training, and storage costs in parametric retrieval-augmented generation (PRAG) for large language models by proposing Dynamic Parametric RAG (DyPRAG), which uses a lightweight parameter translator to efficiently convert documents into parametric knowledge, reducing costs and improving generalization while mitigating RAG hallucination.

Retrieval-augmented generation (RAG) enhances large language models (LLMs) by retrieving relevant documents from external sources and incorporating them into the context. While it improves reliability by providing factual texts, it significantly increases inference costs as context length grows and introduces challenging issue of RAG hallucination, primarily caused by the lack of corresponding parametric knowledge in LLMs. An efficient solution is to enhance the knowledge of LLMs at test-time. Parametric RAG (PRAG) addresses this by embedding document into LLMs parameters to perform test-time knowledge enhancement, effectively reducing inference costs through offline training. However, its high training and storage costs, along with limited generalization ability, significantly restrict its practical adoption. To address these challenges, we propose Dynamic Parametric RAG (DyPRAG), a novel framework that leverages a lightweight parameter translator model to efficiently convert documents into parametric knowledge. DyPRAG not only reduces inference, training, and storage costs but also dynamically generates parametric knowledge, seamlessly enhancing the knowledge of LLMs and resolving knowledge conflicts in a plug-and-play manner at test-time. Extensive experiments on multiple datasets demonstrate the effectiveness and generalization capabilities of DyPRAG, offering a powerful and practical RAG paradigm which enables superior knowledge fusion and mitigates RAG hallucination in real-world applications. Our code is available at https://github.com/Trae1ounG/DyPRAG.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes