SEHCLGApr 20

Scaling Human-AI Coding Collaboration Requires a Governable Consensus Layer

arXiv:2604.1788383.4h-index: 9
AI Analysis

For developers and AI-assisted coding tools, this addresses the control failure in current AI coding workflows by introducing a governable consensus layer to improve auditability and system robustness.

Vibe coding produces correct code but lacks structural records, making systems opaque and fragile. The authors propose Agentic Consensus, a paradigm using a typed property graph as the primary artifact, with synchronization operators to keep executable artifacts in correspondence, aiming to reduce human intervention compared to chat-driven baselines.

Vibe coding produces correct, executable code at speed, but leaves no record of the structural commitments, dependencies, or evidence behind it. Reviewers cannot determine what invariants were assumed, what changed, or why a regression occurred. This is not a generation failure but a control failure: the dominant artifact of AI-assisted development (code plus chat history) performs dimension collapse, flattening complex system topology into low-dimensional text and making systems opaque and fragile under change. We propose Agentic Consensus: a paradigm in which the consensus layer C, an operable world model represented as a typed property graph, replaces code as the primary artifact of engineering. Executable artifacts are derived from C and kept in correspondence via synchronization operators Phi (realize) and Psi (rehydrate). Evidence links directly to structural claims in C, making every commitment auditable and under-specification explicit as measurable consensus entropy rather than a silent guess. Evaluation must move beyond code correctness toward alignment fidelity, consensus entropy, and intervention distance. We propose benchmark task families designed to measure whether consensus-based workflows reduce human intervention compared to chat-driven baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes