SEAIDec 25, 2025

How Do Agents Perform Code Optimization? An Empirical Study

arXiv:2512.21757v12 citationsh-index: 5
Originality Synthesis-oriented
AI Analysis

This addresses the gap in knowledge about AI agents' effectiveness in performance optimization for software developers, but it is incremental as it focuses on empirical comparison without introducing new methods.

The study tackled the problem of understanding how AI coding agents perform on real-world performance optimization tasks by comparing agent- and human-authored commits, finding that AI-authored PRs are less likely to include explicit performance validation (45.7% vs. 63.6%, p=0.007) and largely use similar optimization patterns.

Performance optimization is a critical yet challenging aspect of software development, often requiring a deep understanding of system behavior, algorithmic tradeoffs, and careful code modifications. Although recent advances in AI coding agents have accelerated code generation and bug fixing, little is known about how these agents perform on real-world performance optimization tasks. We present the first empirical study comparing agent- and human-authored performance optimization commits, analyzing 324 agent-generated and 83 human-authored PRs from the AIDev dataset across adoption, maintainability, optimization patterns, and validation practices. We find that AI-authored performance PRs are less likely to include explicit performance validation than human-authored PRs (45.7\% vs. 63.6\%, $p=0.007$). In addition, AI-authored PRs largely use the same optimization patterns as humans. We further discuss limitations and opportunities for advancing agentic code optimization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes