MAAICLJan 15

Learning Latency-Aware Orchestration for Parallel Multi-Agent Systems

arXiv:2601.10560v15 citationsh-index: 7Has Code
Originality Incremental advance
AI Analysis

This work addresses latency issues for scalable multi-agent systems in time-sensitive applications, representing an incremental improvement over existing methods.

The paper tackles the high inference latency problem in multi-agent systems by proposing a latency-aware orchestration framework that enables parallel execution, reducing critical path length by 38-46% compared to state-of-the-art baselines while maintaining task performance.

Multi-agent systems (MAS) enable complex reasoning by coordinating multiple agents, but often incur high inference latency due to multi-step execution and repeated model invocations, severely limiting their scalability and usability in time-sensitive scenarios. Most existing approaches primarily optimize task performance and inference cost, and explicitly or implicitly assume sequential execution, making them less optimal for controlling latency under parallel execution. In this work, we investigate learning-based orchestration of multi-agent systems with explicit latency supervision under parallel execution. We propose Latency-Aware Multi-agent System (LAMaS), a latency-aware multi-agent orchestration framework that enables parallel execution and explicitly optimizes the critical execution path, allowing the controller to construct execution topology graphs with lower latency under parallel execution. Our experiments show that our approach reduces critical path length by 38-46% compared to the state-of-the-art baseline for multi-agent architecture search across multiple benchmarks, while maintaining or even improving task performance. These results highlight the importance of explicitly optimizing latency under parallel execution when designing efficient multi-agent systems. The code is available at https://github.com/xishi404/LAMaS

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes