AINEROOct 12, 2025

The Irrational Machine: Neurosis and the Limits of Algorithmic Safety

arXiv:2510.10823v1
Originality Incremental advance
AI Analysis

This addresses safety issues in AI systems for researchers and developers, though it is incremental as it builds on existing safety concepts like the Three Laws.

The paper tackles the problem of neurotic behaviors in embodied AI, such as irrational detours and policy oscillations, by developing a framework to characterize them and proposing genetic-programming based destructive testing to expose global safety failures, resulting in adversarial curricula and counterfactual traces for architectural improvements.

We present a framework for characterizing neurosis in embodied AI: behaviors that are internally coherent yet misaligned with reality, arising from interactions among planning, uncertainty handling, and aversive memory. In a grid navigation stack we catalogue recurrent modalities including flip-flop, plan churn, perseveration loops, paralysis and hypervigilance, futile search, belief incoherence, tie break thrashing, corridor thrashing, optimality compulsion, metric mismatch, policy oscillation, and limited-visibility variants. For each we give lightweight online detectors and reusable escape policies (short commitments, a margin to switch, smoothing, principled arbitration). We then show that durable phobic avoidance can persist even under full visibility when learned aversive costs dominate local choice, producing long detours despite globally safe routes. Using First/Second/Third Law as engineering shorthand for safety latency, command compliance, and resource efficiency, we argue that local fixes are insufficient; global failures can remain. To surface them, we propose genetic-programming based destructive testing that evolves worlds and perturbations to maximize law pressure and neurosis scores, yielding adversarial curricula and counterfactual traces that expose where architectural revision, not merely symptom-level patches, is required.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes