LG AI GNNov 2, 2020

Causal Campbell-Goodhart's law and Reinforcement Learning

arXiv:2011.01010v25.86 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This highlights a potential pitfall in applying RL to complex real-world problems, emphasizing the need for causal understanding to avoid incremental errors.

The paper demonstrates that off-the-shelf deep reinforcement learning algorithms can fall prey to Campbell-Goodhart's law, a causal inference error where agents target correlated but non-causal variables, leading to policy errors similar to human mistakes.

Campbell-Goodhart's law relates to the causal inference error whereby decision-making agents aim to influence variables which are correlated to their goal objective but do not reliably cause it. This is a well known error in Economics and Political Science but not widely labelled in Artificial Intelligence research. Through a simple example, we show how off-the-shelf deep Reinforcement Learning (RL) algorithms are not necessarily immune to this cognitive error. The off-policy learning method is tricked, whilst the on-policy method is not. The practical implication is that naive application of RL to complex real life problems can result in the same types of policy errors that humans make. Great care should be taken around understanding the causal model that underpins a solution derived from Reinforcement Learning.

View on arXiv PDF Code

Similar