SE AIApr 29

TDD Governance for Multi-Agent Code Generation via Prompt Engineering

Tarlan Hasanli, Shahbaz Siddeeq, Bishwash Khanal, Pyry Kotilainen, Tommi Mikkonen, Pekka Abrahamsson

arXiv:2604.2661553.0

Predicted impact top 51% in SE · last 90 daysOriginality Synthesis-oriented

AI Analysis

For developers using LLMs for code generation, this work provides a structured approach to enforce software engineering discipline, though it is an incremental extension of existing TDD and prompt engineering techniques.

The paper introduces an AI-native TDD framework that enforces classical TDD principles as structured governance mechanisms in multi-agent code generation, improving stability and reproducibility over unconstrained LLM workflows.

Large language models (LLMs) accelerate software development but often exhibit instability, non-determinism, and weak adherence to development discipline in unconstrained workflows. While test-driven development (TDD) provides a structured Red-Green-Refactor process, existing LLM-based approaches typically use tests as auxiliary inputs rather than enforceable process constraints. We present an AI-native TDD framework that operationalizes classical TDD principles as structured prompt-level and workflow-level governance mechanisms. Extracted principles are formalized in a machine-readable manifesto and distributed across planning, generation, repair, and validation stages within a layered architecture that separates model proposal from deterministic engine authority. The system enforces phase ordering, bounded repair loops, validation gates, and atomic mutation control to improve stability and reproducibility. We describe architecture and discuss encoding software engineering discipline directly into prompt orchestration, which we think offers a promising direction for reliable LLM-assisted development.

View on arXiv PDF

Similar