ML LGApr 7, 2022

Q-learning with online random forests

arXiv:2204.03771v16 citationsh-index: 17

Originality Incremental advance

AI Analysis

This work addresses a domain-specific problem in reinforcement learning for tasks that do not require strong domain representation, offering an incremental improvement over existing methods.

The authors tackled the problem of approximating the Q-function in Q-learning by using online random forests, specifically expanding forests that grow during learning, and demonstrated improved performance over Deep Q-Networks in two OpenAI gym environments (blackjack and inverted pendulum) but not in lunar lander.

$Q$-learning is the most fundamental model-free reinforcement learning algorithm. Deployment of $Q$-learning requires approximation of the state-action value function (also known as the $Q$-function). In this work, we provide online random forests as $Q$-function approximators and propose a novel method wherein the random forest is grown as learning proceeds (through expanding forests). We demonstrate improved performance of our methods over state-of-the-art Deep $Q$-Networks in two OpenAI gyms (`blackjack' and `inverted pendulum') but not in the `lunar lander' gym. We suspect that the resilience to overfitting enjoyed by random forests recommends our method for common tasks that do not require a strong representation of the problem domain. We show that expanding forests (in which the number of trees increases as data comes in) improve performance, suggesting that expanding forests are viable for other applications of online random forests beyond the reinforcement learning setting.

View on arXiv PDF

Similar