Time, Identity and Consciousness in Language Model Agents
This provides a method for assessing identity in AI agents, which is incremental as it builds on existing Stack Theory concepts.
The paper tackles the problem of evaluating machine consciousness in language model agents by separating behavioral claims from underlying organizational stability, resulting in a toolkit that quantifies identity persistence through two scores and maps scaffolds into an identity morphospace.
Machine consciousness evaluations mostly see behavior. For language model agents that behavior is language and tool use. That lets an agent say the right things about itself even when the constraints that should make those statements matter are not jointly present at decision time. We apply Stack Theory's temporal gap to scaffold trajectories. This separates ingredient-wise occurrence within an evaluation window from co-instantiation at a single objective step. We then instantiate Stack Theory's Arpeggio and Chord postulates on grounded identity statements. This yields two persistence scores that can be computed from instrumented scaffold traces. We connect these scores to five operational identity metrics and map common scaffolds into an identity morphospace that exposes predictable tradeoffs. The result is a conservative toolkit for identity evaluation. It separates talking like a stable self from being organized like one.