AIMASIGNSep 26, 2025

Reimagining Agent-based Modeling with Large Language Model Agents via Shachi

arXiv:2509.21862v20.10h-index: 4Has Code
AI Analysis85

This provides a rigorous, open-source foundation for building and evaluating LLM agents, aimed at fostering more cumulative and scientifically grounded research in multi-agent systems.

The authors tackled the challenge of studying emergent behaviors in LLM-driven multi-agent systems by introducing Shachi, a formal methodology and modular framework that decomposes agent policies into cognitive components. They validated it on a 10-task benchmark and demonstrated its external validity by modeling a real-world U.S. tariff shock, showing agent behaviors aligned with observed market reactions only with proper configuration.

The study of emergent behaviors in large language model (LLM)-driven multi-agent systems is a critical research challenge, yet progress is limited by a lack of principled methodologies for controlled experimentation. To address this, we introduce Shachi, a formal methodology and modular framework that decomposes an agent's policy into core cognitive components: Configuration for intrinsic traits, Memory for contextual persistence, and Tools for expanded capabilities, all orchestrated by an LLM reasoning engine. This principled architecture moves beyond brittle, ad-hoc agent designs and enables the systematic analysis of how specific architectural choices influence collective behavior. We validate our methodology on a comprehensive 10-task benchmark and demonstrate its power through novel scientific inquiries. Critically, we establish the external validity of our approach by modeling a real-world U.S. tariff shock, showing that agent behaviors align with observed market reactions only when their cognitive architecture is appropriately configured with memory and tools. Our work provides a rigorous, open-source foundation for building and evaluating LLM agents, aimed at fostering more cumulative and scientifically grounded research.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes