AIMay 10

From Passive Reuse to Active Reasoning: Grounding Large Language Models for Neuro-Symbolic Experience Replay

arXiv:2605.0941969.9
Predicted impact top 51% in AI · last 90 daysOriginality Incremental advance
AI Analysis

For reinforcement learning, NSER bridges the gap between linguistic reasoning and numerical optimization to improve sample efficiency, but it is an incremental hybrid approach.

NSER transforms experience replay in RL from passive sample reuse to active knowledge construction by using LLMs to induce behavioral rules, grounding them into differentiable logic, and dynamically reweighting replay distribution, achieving consistent superior sample efficiency and convergence speed across benchmarks.

While experience replay is essential for data efficiency in reinforcement learning (RL), standard methods treat the replay buffer as a passive memory system, prioritizing samples based on numerical prediction errors rather than their semantic significance. This approach stands in contrast to human learning, which accelerates mastery by actively abstracting fragmented experiences into behavioral rules. To bridge this gap, we propose Neuro-Symbolic Experience Replay (NSER), a framework that transforms experience replay from a passive sample reuse mechanism into an active engine for knowledge construction. Specifically, NSER addresses the incompatibility between linguistic reasoning and numerical optimization through a novel neuro-symbolic grounding pipeline. It leverages Large Language Models (LLMs) in a zero-shot manner to induce candidate behavioral rules from accumulated trajectories, grounds these insights into differentiable first-order logic representations, and utilizes the resulting symbolic structures to dynamically reweight the replay distribution. By allowing abstract knowledge to directly shape policy optimization, NSER achieves consistent superior sample efficiency and convergence speed across reactive, rule-based, and procedural benchmarks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes