AIJan 19, 2024

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

arXiv:2401.10568v239 citationsHas CodeICLR
AI Analysis

This work addresses the need for environments that test both learning and reasoning in AI agents, though it is incremental as it builds on existing game-based simulation frameworks.

The authors introduced CivRealm, a complex environment based on the Civilization game, to challenge decision-making agents with both learning and reasoning tasks, but initial results showed that both RL- and LLM-based agents struggled to make substantial progress in the full game.

The generalization of decision-making agents encompasses two fundamental elements: learning from past experiences and reasoning in novel contexts. However, the predominant emphasis in most interactive environments is on learning, often at the expense of complexity in reasoning. In this paper, we introduce CivRealm, an environment inspired by the Civilization game. Civilization's profound alignment with human history and society necessitates sophisticated learning, while its ever-changing situations demand strong reasoning to generalize. Particularly, CivRealm sets up an imperfect-information general-sum game with a changing number of players; it presents a plethora of complex features, challenging the agent to deal with open-ended stochastic environments that require diplomacy and negotiation skills. Within CivRealm, we provide interfaces for two typical agent types: tensor-based agents that focus on learning, and language-based agents that emphasize reasoning. To catalyze further research, we present initial results for both paradigms. The canonical RL-based agents exhibit reasonable performance in mini-games, whereas both RL- and LLM-based agents struggle to make substantial progress in the full game. Overall, CivRealm stands as a unique learning and reasoning challenge for decision-making agents. The code is available at https://github.com/bigai-ai/civrealm.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes