GTMay 22

Anytime Detection of Strategic Deviations in Multi-Agent Systems

arXiv:2601.0542790.03 citationsh-index: 7
AI Analysis

It provides a statistically rigorous, online method for detecting behavioral drift in multi-agent systems, addressing a practical need for real-time monitoring in repeated interactions.

The paper introduces a sequential testing framework for detecting strategic deviations in multi-agent systems, using e-values to construct test supermartingales that monitor departures from equilibrium in normal-form and stochastic games, with finite-time guarantees and false discovery rate control.

In many multi-agent systems, agents interact repeatedly and are expected to settle into stable, rational behavior over time. Yet in practice, behavior often drifts, and detecting such deviations in real time remains an open challenge. We introduce a sequential testing framework that monitors whether observed play is consistent with a benchmark of strategic behavior, without assuming a fixed sample size. Our approach builds on the e-value framework for safe anytime-valid inference: by "betting" against the benchmark, we construct a test supermartingale that accumulates evidence whenever observed payoffs systematically violate the expected conditions. For repeated normal-form games, we take equilibrium as the benchmark, yielding a statistically sound, interpretable measure of departure from equilibrium that can be monitored online; our framework unifies the treatment of Nash, correlated, and coarse correlated equilibria, offering finite-time guarantees and a detailed analysis of detection times. We also leverage Benjamini-Hochberg-type procedures to increase detection power in large games while rigorously controlling the false discovery rate. Finally, we extend our method to stochastic games, verifying online whether observed trajectories adhere to a specified target policy, such as a computed equilibrium, broadening the framework's applicability to dynamic, state-dependent settings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes