AI HCApr 7

Context-Value-Action Architecture for Value-Driven Large Language Model Agents

TianZe Zhang, Sirui Sun, Yuhang Xie, Xin Zhang, Zhiqiang Wu, Guojie Song

arXiv:2604.0593989.3

AI Analysis

This addresses the issue of unrealistic and polarized behavior in LLM agents for applications requiring human-like simulation, representing a novel method rather than an incremental improvement.

The paper tackles the problem of behavioral rigidity and value polarization in LLM-based agents, proposing the Context-Value-Action (CVA) architecture, which significantly outperforms baselines on CVABench with over 1.1 million real-world interaction traces by mitigating polarization and improving fidelity and interpretability.

Large Language Models (LLMs) have shown promise in simulating human behavior, yet existing agents often exhibit behavioral rigidity, a flaw frequently masked by the self-referential bias of current "LLM-as-a-judge" evaluations. By evaluating against empirical ground truth, we reveal a counter-intuitive phenomenon: increasing the intensity of prompt-driven reasoning does not enhance fidelity but rather exacerbates value polarization, collapsing population diversity. To address this, we propose the Context-Value-Action (CVA) architecture, grounded in the Stimulus-Organism-Response (S-O-R) model and Schwartz's Theory of Basic Human Values. Unlike methods relying on self-verification, CVA decouples action generation from cognitive reasoning via a novel Value Verifier trained on authentic human data to explicitly model dynamic value activation. Experiments on CVABench, which comprises over 1.1 million real-world interaction traces, demonstrate that CVA significantly outperforms baselines. Our approach effectively mitigates polarization while offering superior behavioral fidelity and interpretability.

View on arXiv PDF

Similar