AI GTJan 25

DIML: Differentiable Inverse Mechanism Learning from Behaviors of Multi-Agent Learning Trajectories

arXiv:2601.17678v1Has Code

Originality Incremental advance

AI Analysis

This addresses the challenge of inferring unstructured mechanisms in multi-agent systems, offering a novel approach for observational settings, though it is incremental as it builds on existing inverse learning and mechanism design concepts.

The paper tackles the problem of recovering unknown incentive-generating mechanisms from observed strategic interactions of self-interested learning agents, proposing DIML, a likelihood-based framework that reliably recovers identifiable incentive differences and scales to large environments with hundreds of participants.

We study inverse mechanism learning: recovering an unknown incentive-generating mechanism from observed strategic interaction traces of self-interested learning agents. Unlike inverse game theory and multi-agent inverse reinforcement learning, which typically infer utility/reward parameters inside a structured mechanism, our target includes unstructured mechanism -- a (possibly neural) mapping from joint actions to per-agent payoffs. Unlike differentiable mechanism design, which optimizes mechanisms forward, we infer mechanisms from behavior in an observational setting. We propose DIML, a likelihood-based framework that differentiates through a model of multi-agent learning dynamics and uses the candidate mechanism to generate counterfactual payoffs needed to predict observed actions. We establish identifiability of payoff differences under a conditional logit response model and prove statistical consistency of maximum likelihood estimation under standard regularity conditions. We evaluate DIML with simulated interactions of learning agents across unstructured neural mechanisms, congestion tolling, public goods subsidies, and large-scale anonymous games. DIML reliably recovers identifiable incentive differences and supports counterfactual prediction, where its performance rivals tabular enumeration oracle in small environments and its convergence scales to large, hundred-participant environments. Code to reproduce our experiments is open-sourced.

View on arXiv PDF

Similar