SEAIJul 31, 2025

Trae Agent: An LLM-based Agent for Software Engineering with Test-time Scaling

Peking U
arXiv:2507.23370v180 citationsh-index: 43Has Code
Originality Incremental advance
AI Analysis

This work solves the problem of enhancing software engineering efficiency for developers by providing a more effective LLM-based agent, though it is incremental as it builds on existing ensemble reasoning techniques.

The paper tackles software issue resolution by proposing Trae Agent, an agent-based ensemble reasoning approach that addresses limitations in exploring large ensemble spaces and repository-level understanding, achieving a 10.22% average improvement in Pass@1 over baselines and a top score of 75.20% on the SWE-bench benchmark.

Software issue resolution is a critical challenge in software engineering and has garnered increasing attention in recent years. With the rapid advancement of large language models (LLMs), substantial progress has been made in addressing real-world software engineering tasks. Recent studies have introduced ensemble reasoning techniques to enhance the performance of LLM-based issue resolution. However, existing prompting-based methods still face limitations in effectively exploring large ensemble spaces and lack the capacity for repository-level understanding, both of which constrain their overall effectiveness. In this paper, we propose Trae Agent, the first agent-based ensemble reasoning approach for repository-level issue resolution. Trae Agent formulates our goal as an optimal solution search problem and addresses two key challenges, i.e., large ensemble spaces and repository-level understanding, through modular agents for generation, pruning, and selection. We conduct extensive experiments using three leading LLMs on the widely-adopted SWE-bench benchmark, comparing Trae Agent against four state-of-the-art ensemble reasoning techniques. Experimental results demonstrate that Trae Agent consistently achieves superior performance, with an average improvement of 10.22% over all baselines in terms of Pass@1. Trae Agent has achieved first place on the SWE-bench Verified leaderboard, with a notable Pass@1 score of 75.20%. We are pleased to release Trae Agent as an open-source project to support the research community, with all resources available at https://github.com/bytedance/trae-agent.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes