AIAug 13, 2025

Human-Aligned Procedural Level Generation Reinforcement Learning via Text-Level-Sketch Shared Representation

arXiv:2508.09860v13 citationsh-index: 23
Originality Incremental advance
AI Analysis

This work addresses the need for more human-centered AI tools in collaborative content creation for game designers, though it appears incremental by extending control modalities in PCGRL.

The paper tackled the problem of human-aligned procedural content generation via reinforcement learning (PCGRL) by proposing VIPCGRL, a framework that incorporates text, level, and sketch modalities to enhance human-likeness, and it outperformed existing baselines in human-likeness as validated by quantitative metrics and human evaluations.

Human-aligned AI is a critical component of co-creativity, as it enables models to accurately interpret human intent and generate controllable outputs that align with design goals in collaborative content creation. This direction is especially relevant in procedural content generation via reinforcement learning (PCGRL), which is intended to serve as a tool for human designers. However, existing systems often fall short of exhibiting human-centered behavior, limiting the practical utility of AI-driven generation tools in real-world design workflows. In this paper, we propose VIPCGRL (Vision-Instruction PCGRL), a novel deep reinforcement learning framework that incorporates three modalities-text, level, and sketches-to extend control modality and enhance human-likeness. We introduce a shared embedding space trained via quadruple contrastive learning across modalities and human-AI styles, and align the policy using an auxiliary reward based on embedding similarity. Experimental results show that VIPCGRL outperforms existing baselines in human-likeness, as validated by both quantitative metrics and human evaluations. The code and dataset will be available upon publication.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes