LGMLJul 23, 2012

Bellman Error Based Feature Generation using Random Projections on Sparse Spaces

arXiv:1207.5554v32 citations
Originality Incremental advance
AI Analysis

This work addresses feature generation in reinforcement learning, offering a robust method for policy evaluation, though it appears incremental as it builds on existing BEBF concepts.

The paper tackles the problem of automatic feature generation for value function approximation by proposing a fast algorithm using random projections to generate Bellman Error Basis Functions (BEBFs) for sparse spaces, with theoretical guarantees and empirical validation.

We address the problem of automatic generation of features for value function approximation. Bellman Error Basis Functions (BEBFs) have been shown to improve the error of policy evaluation with function approximation, with a convergence rate similar to that of value iteration. We propose a simple, fast and robust algorithm based on random projections to generate BEBFs for sparse feature spaces. We provide a finite sample analysis of the proposed method, and prove that projections logarithmic in the dimension of the original space are enough to guarantee contraction in the error. Empirical results demonstrate the strength of this method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes