LGMay 1, 2025

Policies of Multiple Skill Levels for Better Strength Estimation in Games

Kyota Kuboki, Tatsuyoshi Ogawa, Chu-Hsuan Hsueh, Shi-Jim Yen, Kokolo Ikeda

arXiv:2505.00279v1h-index: 12Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment

Originality Incremental advance

AI Analysis

This work improves human-AI interaction in games by providing more precise skill estimation, though it is incremental as it builds on existing strength estimation methods.

The paper tackled the problem of accurately estimating human skill levels in games by incorporating policies for different skill levels, achieving improvements of 8-9% in accuracy over previous methods, with up to 92% accuracy in Go given 20 matches.

Accurately estimating human skill levels is crucial for designing effective human-AI interactions so that AI can provide appropriate challenges or guidance. In games where AI players have beaten top human professionals, strength estimation plays a key role in adapting AI behavior to match human skill levels. In a previous state-of-the-art study, researchers have proposed a strength estimator trained using human players' match data. Given some matches, the strength estimator computes strength scores and uses them to estimate player ranks (skill levels). In this paper, we focus on the observation that human players' behavior tendency varies according to their strength and aim to improve the accuracy of strength estimation by taking this into account. Specifically, in addition to strength scores, we obtain policies for different skill levels from neural networks trained using human players' match data. We then combine features based on these policies with the strength scores to estimate strength. We conducted experiments on Go and chess. For Go, our method achieved an accuracy of 80% in strength estimation when given 10 matches, which increased to 92% when given 20 matches. In comparison, the previous state-of-the-art method had an accuracy of 71% with 10 matches and 84% with 20 matches, demonstrating improvements of 8-9%. We observed similar improvements in chess. These results contribute to developing a more accurate strength estimation method and to improving human-AI interaction.

View on arXiv PDF

Similar