LGMLJun 4

Adaptive state-action abstractions via rate-distortion

arXiv:2606.0612314.1
Predicted impact top 57% in LG · last 90 daysOriginality Incremental advance
AI Analysis

It addresses the problem of automatically selecting abstraction granularity during learning for reinforcement learning agents, but the results are limited to tabular settings and the approach is incremental.

The paper proposes a principle for dynamically adjusting the granularity of state-action abstractions in reinforcement learning, based on a trade-off between learning error and abstraction error. In tabular settings, the method achieves near-optimal performance under substantial lossy compression.

When learning to walk, infants seem to address a coarse version of the problem first - stay upright, reach the caregiver - and refine it only when further practice at that resolution stops paying off. Reinforcement learning offers multiple techniques for building simple versions of complex tasks, but lacks general principles for how to dynamically adjust the granularity of these abstractions during learning. This paper proposes one such principle: refine the abstraction as soon as the learning error within it becomes comparable to the error induced by the abstraction itself. Here, we investigate one way of formalising this principle via a performance certificate that decomposes value error into two terms: a learning error bound captured by a Bellman residual, and an abstraction error bound given by a bisimulation metric. The resulting switching strategy is implemented by soft state-action abstractions built from rate-distortion principles, whose resolution along state and action axes can be continuously adjusted. We validate this construction in a range of tabular settings, showing that near-optimal performance can be achieved under substantial lossy compression of state and action information.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes