From Gameplay Traces to Game Mechanics: Causal Induction with Large Language Models

Mohit Jiwatode, Alexander Dockhorn, Bodo Rosenhahn

arXiv:2602.00190v12.4h-index: 5

Originality Incremental advance

AI Analysis

This work addresses the challenge of interpretability in AI agents for game domains, offering a method to reverse-engineer rules for applications like causal reinforcement learning, though it is incremental as it builds on existing frameworks like GVGAI.

The paper tackled the problem of inferring causal game mechanics from gameplay traces using Large Language Models, finding that a two-stage method involving structural causal models produced more accurate Video Game Description Language rules than direct generation, achieving up to 81% preference win rates in evaluations.

Deep learning agents can achieve high performance in complex game domains without often understanding the underlying causal game mechanics. To address this, we investigate Causal Induction: the ability to infer governing laws from observational data, by tasking Large Language Models (LLMs) with reverse-engineering Video Game Description Language (VGDL) rules from gameplay traces. To reduce redundancy, we select nine representative games from the General Video Game AI (GVGAI) framework using semantic embeddings and clustering. We compare two approaches to VGDL generation: direct code generation from observations, and a two-stage method that first infers a structural causal model (SCM) and then translates it into VGDL. Both approaches are evaluated across multiple prompting strategies and controlled context regimes, varying the amount and form of information provided to the model, from just raw gameplay observations to partial VGDL specifications. Results show that the SCM-based approach more often produces VGDL descriptions closer to the ground truth than direct generation, achieving preference win rates of up to 81\% in blind evaluations and yielding fewer logically inconsistent rules. These learned SCMs can be used for downstream use cases such as causal reinforcement learning, interpretable agents, and procedurally generating novel but logically consistent games.

View on arXiv PDF

Similar