AINCJun 24, 2021

Modeling the Mistakes of Boundedly Rational Agents Within a Bayesian Theory of Mind

arXiv:2106.13249v127 citations
Originality Incremental advance
AI Analysis

This addresses the problem of improving AI's ability to understand human behavior for applications like teaching and assistance, though it is incremental by extending an existing framework.

The paper tackled the problem that Bayesian models of theory of mind fail to account for human mistakes, such as locking oneself out, by extending the framework to model boundedly rational agents with errors in goals, plans, and actions. The model better explained human goal inferences in gridworld and block-stacking experiments, generalizing across domains.

When inferring the goals that others are trying to achieve, people intuitively understand that others might make mistakes along the way. This is crucial for activities such as teaching, offering assistance, and deciding between blame or forgiveness. However, Bayesian models of theory of mind have generally not accounted for these mistakes, instead modeling agents as mostly optimal in achieving their goals. As a result, they are unable to explain phenomena like locking oneself out of one's house, or losing a game of chess. Here, we extend the Bayesian Theory of Mind framework to model boundedly rational agents who may have mistaken goals, plans, and actions. We formalize this by modeling agents as probabilistic programs, where goals may be confused with semantically similar states, plans may be misguided due to resource-bounded planning, and actions may be unintended due to execution errors. We present experiments eliciting human goal inferences in two domains: (i) a gridworld puzzle with gems locked behind doors, and (ii) a block-stacking domain. Our model better explains human inferences than alternatives, while generalizing across domains. These findings indicate the importance of modeling others as bounded agents, in order to account for the full richness of human intuitive psychology.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes