CLApr 4, 2025

Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation

Weitao Li, Kaiming Liu, Xiangyu Zhang, Xuanyu Lei, Weizhi Ma, Yang Liu

arXiv:2504.03165v36.72 citationsh-index: 26Has CodeEMNLP

Originality Incremental advance

AI Analysis

This addresses retrieval noise and redundancy in RAG systems for knowledge-QA and hallucination detection, but appears incremental as it builds on existing RAG methods with clustering enhancements.

The paper tackles the problem of noise and redundancy in retrieved documents for Retrieval-Augmented Generation (RAG), which can cause errors in generation results, by proposing an Efficient Dynamic Clustering-based document Compression framework (EDC2-RAG) that improves performance across various scenarios and settings.

Retrieval-Augmented Generation (RAG) has emerged as a widely adopted approach for knowledge injection during large language model (LLM) inference in recent years. However, due to their limited ability to exploit fine-grained inter-document relationships, current RAG implementations face challenges in effectively addressing the retrieved noise and redundancy content, which may cause error in the generation results. To address these limitations, we propose an Efficient Dynamic Clustering-based document Compression framework (EDC2-RAG) that utilizes latent inter-document relationships while simultaneously removing irrelevant information and redundant content. We validate our approach, built upon GPT-3.5-Turbo and GPT-4o-mini, on widely used knowledge-QA and Hallucination-Detection datasets. Experimental results show that our method achieves consistent performance improvements across various scenarios and experimental settings, demonstrating strong robustness and applicability. Our code and datasets are available at https://github.com/Tsinghua-dhy/EDC-2-RAG.

View on arXiv PDF Code

Similar