AICLMar 18

Retrieval-Augmented LLM Agents: Learning to Learn from Experience

arXiv:2603.1827296.51 citationsh-index: 26
AI Analysis

This work addresses the problem of building scalable and effective agents that learn from experience for AI researchers and practitioners, representing an incremental improvement over existing methods.

The paper tackles the challenge of robust generalization to unseen tasks in LLM-based agents by combining fine-tuning and experience retrieval, resulting in a pipeline that significantly improves generalization.

While large language models (LLMs) have advanced the development of general-purpose agents, achieving robust generalization to unseen tasks remains a significant challenge. Current approaches typically rely on either fine-tuning or training-free memory-augmented generation using retrieved experience; yet both have limitations: fine-tuning often fails to extrapolate to new tasks, while experience retrieval often underperforms compared to supervised baselines. In this work, we propose to combine these approaches and systematically study how to train retrieval-augmented LLM agents to effectively leverage retrieved trajectories in-context. First, we establish a robust supervised fine-tuning (SFT) recipe using LoRA that outperforms several state-of-the-art agent training pipelines. Second, we provide a detailed analysis of key design choices for experience retrieval, identifying optimal strategies for storage, querying, and trajectory selection. Finally, we propose a pipeline that integrates experience retrieval into the fine-tuning process. Our results demonstrate that this combined approach significantly improves generalization to unseen tasks, providing a scalable and effective framework for building agents that learn to learn from experience.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes