AIMAJan 18, 2019

Theory of Minds: Understanding Behavior in Groups Through Inverse Planning

arXiv:1901.06085v197 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of building machine-learning algorithms with human-like social intelligence for applications in cooperation and competition, representing an incremental advance in multi-agent inference methods.

The paper tackled the problem of inferring latent social relationships from sparse observations of multi-agent behavior by developing a generative model based on Composable Team Hierarchies (CTH), which recovers underlying causal models from few observations and aligns closely with human judgments.

Human social behavior is structured by relationships. We form teams, groups, tribes, and alliances at all scales of human life. These structures guide multi-agent cooperation and competition, but when we observe others these underlying relationships are typically unobservable and hence must be inferred. Humans make these inferences intuitively and flexibly, often making rapid generalizations about the latent relationships that underlie behavior from just sparse and noisy observations. Rapid and accurate inferences are important for determining who to cooperate with, who to compete with, and how to cooperate in order to compete. Towards the goal of building machine-learning algorithms with human-like social intelligence, we develop a generative model of multi-agent action understanding based on a novel representation for these latent relationships called Composable Team Hierarchies (CTH). This representation is grounded in the formalism of stochastic games and multi-agent reinforcement learning. We use CTH as a target for Bayesian inference yielding a new algorithm for understanding behavior in groups that can both infer hidden relationships as well as predict future actions for multiple agents interacting together. Our algorithm rapidly recovers an underlying causal model of how agents relate in spatial stochastic games from just a few observations. The patterns of inference made by this algorithm closely correspond with human judgments and the algorithm makes the same rapid generalizations that people do.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes