Vladimir Marochko

2papers

2 Papers

AIDec 20, 2017
Pseudorehearsal in actor-critic agents with neural network function approximation

Vladimir Marochko, Leonard Johard, Manuel Mazzara et al.

Catastrophic forgetting has a significant negative impact in reinforcement learning. The purpose of this study is to investigate how pseudorehearsal can change performance of an actor-critic agent with neural-network function approximation. We tested agent in a pole balancing task and compared different pseudorehearsal approaches. We have found that pseudorehearsal can assist learning and decrease forgetting.

AIMar 21, 2017
Pseudorehearsal in value function approximation

Vladimir Marochko, Leonard Johard, Manuel Mazzara

Catastrophic forgetting is of special importance in reinforcement learning, as the data distribution is generally non-stationary over time. We study and compare several pseudorehearsal approaches for Q-learning with function approximation in a pole balancing task. We have found that pseudorehearsal seems to assist learning even in such very simple problems, given proper initialization of the rehearsal parameters.