CVAIMay 9

RAG-HAR: Retrieval Augmented Generation-based Human Activity Recognition

arXiv:2512.0898444.42 citationsh-index: 26
Predicted impact top 75% in CV · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses the need for efficient, generalizable HAR systems that avoid dataset-specific training and large labeled corpora, benefiting applications in healthcare and smart environments.

RAG-HAR introduces a training-free retrieval-augmented framework using LLMs for human activity recognition, achieving state-of-the-art performance across six benchmarks without model training or fine-tuning.

Human Activity Recognition (HAR) underpins applications in healthcare, rehabilitation, fitness tracking, and smart environments, yet existing deep learning approaches demand dataset-specific training, large labeled corpora, and significant computational resources.We introduce RAG-HAR, a training-free retrieval-augmented framework that leverages large language models (LLMs) for HAR. RAG-HAR computes lightweight statistical descriptors, retrieves semantically similar samples from a vector database, and uses this contextual evidence to make LLM-based activity identification. We further enhance RAG-HAR by first applying prompt optimization and introducing an LLM-based activity descriptor that generates context-enriched vector databases for delivering accurate and highly relevant contextual information. Along with these mechanisms, RAG-HAR achieves state-of-the-art performance across six diverse HAR benchmarks. Most importantly, RAG-HAR attains these improvements without requiring model training or fine-tuning, emphasizing its robustness and practical applicability. RAG-HAR moves beyond known behaviors, enabling the recognition and meaningful labelling of multiple unseen human activities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes