NEOct 12, 2012

Autonomous Reinforcement of Behavioral Sequences in Neural Dynamics

arXiv:1210.3569v213 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of implementing reinforcement learning in neural systems without losing dynamic properties, which is incremental as it adapts existing methods to a neuroscience context.

The paper tackles the problem of learning behavioral sequences from delayed reward in neural dynamics, introducing DN-SARSA(λ) which combines Dynamic Field Theory, reinforcement learning, and a working memory model, and shows it performs comparably to discrete SARSA(λ) in simulated and real robot experiments.

We introduce a dynamic neural algorithm called Dynamic Neural (DN) SARSA(λ) for learning a behavioral sequence from delayed reward. DN-SARSA(λ) combines Dynamic Field Theory models of behavioral sequence representation, classical reinforcement learning, and a computational neuroscience model of working memory, called Item and Order working memory, which serves as an eligibility trace. DN-SARSA(λ) is implemented on both a simulated and real robot that must learn a specific rewarding sequence of elementary behaviors from exploration. Results show DN-SARSA(λ) performs on the level of the discrete SARSA(λ), validating the feasibility of general reinforcement learning without compromising neural dynamics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes