LGFeb 19, 2025

Playing Hex and Counter Wargames using Reinforcement Learning and Recurrent Neural Networks

arXiv:2502.13918v1h-index: 4
Originality Incremental advance
AI Analysis

This addresses the problem of AI decision-making in complex military simulations for researchers and gamers, but it is incremental as it builds on existing methods like AlphaZero.

The paper tackled the strategic complexity of Hex and Counter Wargames by integrating recurrent neural networks with AlphaZero, showing promising results in generalization across terrain and tactics with minimal training.

Hex and Counter Wargames are adversarial two-player simulations of real military conflicts requiring complex strategic decision-making. Unlike classical board games, these games feature intricate terrain/unit interactions, unit stacking, large maps of varying sizes, and simultaneous move and combat decisions involving hundreds of units. This paper introduces a novel system designed to address the strategic complexity of Hex and Counter Wargames by integrating cutting-edge advancements in Recurrent Neural Networks with AlphaZero, a reliable modern Reinforcement Learning algorithm. The system utilizes a new Neural Network architecture developed from existing research, incorporating innovative state and action representations tailored to these specific game environments. With minimal training, our solution has shown promising results in typical scenarios, demonstrating the ability to generalize across different terrain and tactical situations. Additionally, we explore the system's potential to scale to larger map sizes. The developed system is openly accessible, facilitating continued research and exploration within this challenging domain.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes