AIFeb 20, 2024

Learning and Sustaining Shared Normative Systems via Bayesian Rule Induction in Markov Games

arXiv:2402.13399v214 citationsh-index: 3AAMAS
AI Analysis

This addresses the challenge of building AI agents that can cooperate with human institutions, though it appears incremental in applying Bayesian methods to norm induction in multi-agent settings.

The paper tackles the problem of enabling learning agents to adopt and sustain shared normative systems for flexible cooperation, demonstrating that agents can rapidly learn and sustain cooperative institutions like resource management norms and compensation for pro-social labor, promoting collective welfare.

A universal feature of human societies is the adoption of systems of rules and norms in the service of cooperative ends. How can we build learning agents that do the same, so that they may flexibly cooperate with the human institutions they are embedded in? We hypothesize that agents can achieve this by assuming there exists a shared set of norms that most others comply with while pursuing their individual desires, even if they do not know the exact content of those norms. By assuming shared norms, a newly introduced agent can infer the norms of an existing population from observations of compliance and violation. Furthermore, groups of agents can converge to a shared set of norms, even if they initially diverge in their beliefs about what the norms are. This in turn enables the stability of the normative system: since agents can bootstrap common knowledge of the norms, this leads the norms to be widely adhered to, enabling new entrants to rapidly learn those norms. We formalize this framework in the context of Markov games and demonstrate its operation in a multi-agent environment via approximately Bayesian rule induction of obligative and prohibitive norms. Using our approach, agents are able to rapidly learn and sustain a variety of cooperative institutions, including resource management norms and compensation for pro-social labor, promoting collective welfare while still allowing agents to act in their own interests.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes