AIFeb 15, 2025

PCGRLLM: Large Language Model-Driven Reward Design for Procedural Content Generation Reinforcement Learning

arXiv:2502.10906v13 citationsh-index: 10
Originality Incremental advance
AI Analysis

This work addresses reducing human effort in game AI development for procedural content generation, but it is incremental as it builds on earlier methods.

The paper tackles the problem of reward design for game AI by introducing PCGRLLM, an extended architecture using LLMs with feedback and prompt engineering, resulting in performance improvements of 415% and 40% on a story-to-reward generation task.

Reward design plays a pivotal role in the training of game AIs, requiring substantial domain-specific knowledge and human effort. In recent years, several studies have explored reward generation for training game agents and controlling robots using large language models (LLMs). In the content generation literature, there has been early work on generating reward functions for reinforcement learning agent generators. This work introduces PCGRLLM, an extended architecture based on earlier work, which employs a feedback mechanism and several reasoning-based prompt engineering techniques. We evaluate the proposed method on a story-to-reward generation task in a two-dimensional environment using two state-of-the-art LLMs, demonstrating the generalizability of our approach. Our experiments provide insightful evaluations that demonstrate the capabilities of LLMs essential for content generation tasks. The results highlight significant performance improvements of 415% and 40% respectively, depending on the zero-shot capabilities of the language model. Our work demonstrates the potential to reduce human dependency in game AI development, while supporting and enhancing creative processes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes