LGJul 18, 2022

Word Play for Playing Othello (Reverses)

arXiv:2207.08766v11.81 citationsh-index: 13

Originality Synthesis-oriented

AI Analysis

This provides a novel method for generating game archives to compare strategies, though it is incremental as it extends prior work on chess and Go to Othello.

The research applied GPT-2 and GPT-3 language models to generate Othello games, achieving 13-71% completion with GPT-2 and 41% with GPT-3, and doubled the available game archive size to 240,000 games.

Language models like OpenAI's Generative Pre-Trained Transformers (GPT-2/3) capture the long-term correlations needed to generate text in a variety of domains (such as language translators) and recently in gameplay (chess, Go, and checkers). The present research applies both the larger (GPT-3) and smaller (GPT-2) language models to explore the complex strategies for the game of Othello (or Reverses). Given the game rules for rapid reversals of fortune, the language model not only represents a candidate predictor of the next move based on previous game moves but also avoids sparse rewards in gameplay. The language model automatically captures or emulates championship-level strategies. The fine-tuned GPT-2 model generates Othello games ranging from 13-71% completion, while the larger GPT-3 model reaches 41% of a complete game. Like previous work with chess and Go, these language models offer a novel way to generate plausible game archives, particularly for comparing opening moves across a larger sample than humanly possible to explore. A primary contribution of these models magnifies (by two-fold) the previous record for player archives (120,000 human games over 45 years from 1977-2022), thus supplying the research community with more diverse and original strategies for sampling with other reinforcement learning techniques.

View on arXiv PDF

Similar