MAAIJul 11, 2025

Optimizing Sequential Multi-Step Tasks with Parallel LLM Agents

Microsoft
arXiv:2507.08944v18 citationsh-index: 15
Originality Incremental advance
AI Analysis

This addresses latency issues in multi-agent systems for real-world, high-complexity reasoning tasks, representing an incremental improvement.

The paper tackled the high latency in LLM-based multi-agent systems for complex tasks by proposing M1-Parallel, a framework that runs multiple agent teams in parallel, achieving up to 2.2x speedup while preserving accuracy and higher task completion rates.

Large language model (LLM)-based multi-agent systems have demonstrated remarkable promise for tackling complex tasks by breaking them down into subtasks that are iteratively planned, executed, observed, and refined. Despite their effectiveness, these systems often incur high latency because real-world problems frequently demand multiple iterative cycles of reasoning steps. To address this challenge, we propose M1-Parallel, a framework that concurrently runs multiple multi-agent teams in parallel to uncover distinct solution paths. By leveraging an event-driven communication model with asynchronous messaging, M1-Parallel efficiently capitalizes on the inherent diversity of valid plans to either reduce end-to-end latency or boost task completion rates. Our experiments on complex tasks show that M1-Parallel with early termination achieves up to $2.2\times$ speedup while preserving accuracy, and that M1-Parallel with aggregation yields higher task completion rates. We further investigate strategies aimed at encouraging diverse execution plans but observe no additional performance gains over repeated sampling. Overall, these findings underscore the potential of parallel plan execution for optimizing multi-agent systems for real-world, high-complexity reasoning tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes