Tree Search for Simultaneous Move Games via Equilibrium Approximation
This addresses performance gaps in partial information games like Google Research Football and Starcraft, offering a practical method for competitive, cooperative, and mixed tasks, though it is incremental as it adapts existing algorithms.
The paper tackled adapting tree search algorithms from perfect information to simultaneous-move games by approximating a coarse correlated equilibrium, achieving better performance than current best MARL algorithms on various baseline environments.
Neural network supported tree-search has shown strong results in a variety of perfect information multi-agent tasks. However, the performance of these methods on partial information games has generally been below competing approaches. Here we study the class of simultaneous-move games, which are a subclass of partial information games which are most similar to perfect information games: both agents know the game state with the exception of the opponent's move, which is revealed only after each agent makes its own move. Simultaneous move games include popular benchmarks such as Google Research Football and Starcraft. In this study we answer the question: can we take tree search algorithms trained through self-play from perfect information settings and adapt them to simultaneous move games without significant loss of performance? We answer this question by deriving a practical method that attempts to approximate a coarse correlated equilibrium as a subroutine within a tree search. Our algorithm works on cooperative, competitive, and mixed tasks. Our results are better than the current best MARL algorithms on a wide range of accepted baseline environments.