SEAINov 6, 2025

Does AI-Assisted Coding Deliver? A Difference-in-Differences Study of Cursor's Impact on Software Projects

arXiv:2511.04427v216 citationsh-index: 4
AI Analysis

This addresses the lack of empirical evidence for productivity claims in software engineering, providing insights for practitioners, tool designers, and researchers, though it is incremental in evaluating an existing tool.

The study tackled the problem of measuring the causal impact of AI-assisted coding tools like Cursor on software projects, finding that adoption leads to a significant but transient increase in development velocity and a persistent rise in static analysis warnings and code complexity, which in turn causes long-term velocity slowdown.

Large language models (LLMs) have demonstrated the promise to revolutionize the field of software engineering. Among other things, LLM agents are rapidly gaining momentum in their application to software development, with practitioners claiming a multifold productivity increase after adoption. Yet, empirical evidence is lacking around these claims. In this paper, we estimate the causal effect of adopting a widely popular LLM agent assistant, namely Cursor, on development velocity and software quality. The estimation is enabled by a state-of-the-art difference-in-differences design comparing Cursor-adopting GitHub projects with a matched control group of similar GitHub projects that do not use Cursor. We find that the adoption of Cursor leads to a significant, large, but transient increase in project-level development velocity, along with a significant and persistent increase in static analysis warnings and code complexity. Further panel generalized method of moments estimation reveals that the increase in static analysis warnings and code complexity acts as a major factor causing long-term velocity slowdown. Our study carries implications for software engineering practitioners, LLM agent assistant designers, and researchers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes