AIHCApr 7

Context-Value-Action Architecture for Value-Driven Large Language Model Agents

arXiv:2604.0593989.3
AI Analysis

This addresses the issue of unrealistic and polarized behavior in LLM agents for applications requiring human-like simulation, representing a novel method rather than an incremental improvement.

The paper tackles the problem of behavioral rigidity and value polarization in LLM-based agents, proposing the Context-Value-Action (CVA) architecture, which significantly outperforms baselines on CVABench with over 1.1 million real-world interaction traces by mitigating polarization and improving fidelity and interpretability.

Large Language Models (LLMs) have shown promise in simulating human behavior, yet existing agents often exhibit behavioral rigidity, a flaw frequently masked by the self-referential bias of current "LLM-as-a-judge" evaluations. By evaluating against empirical ground truth, we reveal a counter-intuitive phenomenon: increasing the intensity of prompt-driven reasoning does not enhance fidelity but rather exacerbates value polarization, collapsing population diversity. To address this, we propose the Context-Value-Action (CVA) architecture, grounded in the Stimulus-Organism-Response (S-O-R) model and Schwartz's Theory of Basic Human Values. Unlike methods relying on self-verification, CVA decouples action generation from cognitive reasoning via a novel Value Verifier trained on authentic human data to explicitly model dynamic value activation. Experiments on CVABench, which comprises over 1.1 million real-world interaction traces, demonstrate that CVA significantly outperforms baselines. Our approach effectively mitigates polarization while offering superior behavioral fidelity and interpretability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes