IRCLNov 7, 2024

Deploying Large Language Models With Retrieval Augmented Generation

arXiv:2411.11895v111 citationsh-index: 2Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of grounding LLM outputs in factual data for information systems and behavioral research, but it is incremental as it focuses on field-testing and recommendations rather than novel methods.

The paper tackles the problem of large language models (LLMs) hallucinating by deploying Retrieval Augmented Generation (RAG) in a real-world pilot project, resulting in insights on best practices and an AI governance model for compliance.

Knowing that the generative capabilities of large language models (LLM) are sometimes hampered by tendencies to hallucinate or create non-factual responses, researchers have increasingly focused on methods to ground generated outputs in factual data. Retrieval Augmented Generation (RAG) has emerged as a key approach for integrating knowledge from data sources outside of the LLM's training set, including proprietary and up-to-date information. While many research papers explore various RAG strategies, their true efficacy is tested in real-world applications with actual data. The journey from conceiving an idea to actualizing it in the real world is a lengthy process. We present insights from the development and field-testing of a pilot project that integrates LLMs with RAG for information retrieval. Additionally, we examine the impacts on the information value chain, encompassing people, processes, and technology. Our aim is to identify the opportunities and challenges of implementing this emerging technology, particularly within the context of behavioral research in the information systems (IS) field. The contributions of this work include the development of best practices and recommendations for adopting this promising technology while ensuring compliance with industry regulations through a proposed AI governance model.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes