AIJun 30, 2018

Modeling Friends and Foes

arXiv:1807.00196v13 citations
Originality Incremental advance
AI Analysis

This addresses the need for safe and robust agents to understand environmental attitudes, though it appears incremental as it builds on existing game theory concepts.

The paper tackles the problem of detecting friendly and adversarial behavior from raw data by proposing a definition based on the environment's reaction to an agent's private strategy, and it shows experimentally that both types of environments have non-trivial optimal strategies.

How can one detect friendly and adversarial behavior from raw data? Detecting whether an environment is a friend, a foe, or anything in between, remains a poorly understood yet desirable ability for safe and robust agents. This paper proposes a definition of these environmental "attitudes" based on an characterization of the environment's ability to react to the agent's private strategy. We define an objective function for a one-shot game that allows deriving the environment's probability distribution under friendly and adversarial assumptions alongside the agent's optimal strategy. Furthermore, we present an algorithm to compute these equilibrium strategies, and show experimentally that both friendly and adversarial environments possess non-trivial optimal strategies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes