AINov 13, 2024

Towards Optimizing a Retrieval Augmented Generation using Large Language Model on Academic Data

arXiv:2411.08438v17 citationsh-index: 10Has CodeNLPIR
Originality Synthesis-oriented
AI Analysis

This addresses improving RAG efficiency for academic organizations, but is incremental as it applies known optimizations to a specific domain.

The paper tackles optimizing Retrieval Augmented Generation (RAG) for academic data by testing four optimization techniques, including Multi-Query, and introduces a novel RAG Confusion Matrix for evaluation. Experiments show a significant performance increase with Multi-Query in retrieval.

Given the growing trend of many organizations integrating Retrieval Augmented Generation (RAG) into their operations, we assess RAG on domain-specific data and test state-of-the-art models across various optimization techniques. We incorporate four optimizations; Multi-Query, Child-Parent-Retriever, Ensemble Retriever, and In-Context-Learning, to enhance the functionality and performance in the academic domain. We focus on data retrieval, specifically targeting various study programs at a large technical university. We additionally introduce a novel evaluation approach, the RAG Confusion Matrix designed to assess the effectiveness of various configurations within the RAG framework. By exploring the integration of both open-source (e.g., Llama2, Mistral) and closed-source (GPT-3.5 and GPT-4) Large Language Models, we offer valuable insights into the application and optimization of RAG frameworks in domain-specific contexts. Our experiments show a significant performance increase when including multi-query in the retrieval phase.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes