IRAIHCApr 11, 2025

RAG-VR: Leveraging Retrieval-Augmented Generation for 3D Question Answering in VR Environments

arXiv:2504.08256v23 citationsh-index: 12025 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW)
Originality Incremental advance
AI Analysis

This addresses the challenge of personalized and localized context understanding in VR for users, though it is incremental as it adapts existing RAG methods to a new domain.

The paper tackles the problem of improving question-answering accuracy and latency in VR environments by introducing RAG-VR, a system that uses retrieval-augmented generation with localized knowledge, resulting in accuracy improvements of 17.9%-41.8% and latency reductions of 34.5%-47.3% compared to baselines.

Recent advances in large language models (LLMs) provide new opportunities for context understanding in virtual reality (VR). However, VR contexts are often highly localized and personalized, limiting the effectiveness of general-purpose LLMs. To address this challenge, we present RAG-VR, the first 3D question-answering system for VR that incorporates retrieval-augmented generation (RAG), which augments an LLM with external knowledge retrieved from a localized knowledge database to improve the answer quality. RAG-VR includes a pipeline for extracting comprehensive knowledge about virtual environments and user conditions for accurate answer generation. To ensure efficient retrieval, RAG-VR offloads the retrieval process to a nearby edge server and uses only essential information during retrieval. Moreover, we train the retriever to effectively distinguish among relevant, irrelevant, and hard-to-differentiate information in relation to questions. RAG-VR improves answer accuracy by 17.9%-41.8% and reduces end-to-end latency by 34.5%-47.3% compared with two baseline systems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes