SE AIDec 1, 2025

Beyond Greenfield: The D3 Framework for AI-Driven Productivity in Brownfield Engineering

arXiv:2512.01155v2

Originality Incremental advance

AI Analysis

This addresses productivity issues for software practitioners working with legacy systems, though it is incremental as it builds on existing LLM methods.

The paper tackles the challenge of using large language models (LLMs) in brownfield engineering with legacy systems by introducing the D3 Framework, a structured workflow that led to self-reported productivity gains of 26.9%, reduced cognitive load for 77% of participants, and less time spent fixing code for 83% of participants.

Brownfield engineering work involving legacy systems, incomplete documentation, and fragmented architectural knowledge poses unique challenges for the effective use of large language models (LLMs). Prior research has largely focused on greenfield or synthetic tasks, leaving a gap in structured workflows for complex, context-heavy environments. This paper introduces the Discover-Define-Deliver (D3) Framework, a disciplined LLM-assisted workflow that combines role-separated prompting strategies with applied best practices for navigating ambiguity in brownfield systems. The framework incorporates a dual-agent prompting architecture in which a Builder model generates candidate outputs and a Reviewer model provides structured critique to improve reliability. I conducted an exploratory survey study with 52 software practitioners who applied the D3 workflow to real-world engineering tasks such as legacy system exploration, documentation reconstruction, and architectural refactoring. Respondents reported perceived improvements in task clarity, documentation quality, and cognitive load, along with self-estimated productivity gains. In this exploratory study, participants reported a weighted average productivity improvement of 26.9%, reduced cognitive load for approximately 77% of participants, and 83% of participants spent less time fixing or rewriting code due to better initial planning with AI. As these findings are self-reported and not derived from controlled experiments, they should be interpreted as preliminary evidence of practitioner sentiment rather than causal effects. The results highlight both the potential and limitations of structured LLM workflows for legacy engineering systems and motivate future controlled evaluations.

View on arXiv PDF

Similar