LGMLDec 5, 2019

Hindsight Credit Assignment

arXiv:1912.02503v191 citations
Originality Incremental advance
AI Analysis

This addresses credit assignment challenges in reinforcement learning, but appears incremental as it builds on existing value function concepts.

The paper tackles the problem of efficient credit assignment in reinforcement learning by proposing a method that assigns credit to past decisions based on the likelihood they led to observed outcomes, using hindsight instead of foresight, and shows that this yields a new family of algorithms that successfully address credit assignment challenges in illustrative tasks.

We consider the problem of efficient credit assignment in reinforcement learning. In order to efficiently and meaningfully utilize new data, we propose to explicitly assign credit to past decisions based on the likelihood of them having led to the observed outcome. This approach uses new information in hindsight, rather than employing foresight. Somewhat surprisingly, we show that value functions can be rewritten through this lens, yielding a new family of algorithms. We study the properties of these algorithms, and empirically show that they successfully address important credit assignment challenges, through a set of illustrative tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes