CLAIMAJun 3

Streaming Communication in Multi-Agent Reasoning

arXiv:2606.0515894.4
Predicted impact top 14% in CL · last 90 daysOriginality Highly original
AI Analysis

For practitioners of multi-agent reasoning systems, this work offers a simple protocol change that simultaneously improves speed and accuracy, with a discovered step-level scaling law.

StreamMA reduces latency in multi-agent reasoning by streaming steps to downstream agents as they are generated, and improves accuracy by preventing error-prone late steps from misleading downstream agents. Across eight benchmarks, it outperforms baselines by an average of 7.3 percentage points (up to 22.4 pp on HMMT 2026).

Multi-agent reasoning systems adopt a "generate-then-transfer" paradigm that forces end-to-end latency to scale linearly with pipeline depth. We introduce StreamMA, a multi-agent reasoning system that streams each reasoning step to downstream agents as soon as it is generated, pipelining adjacent agents and thus reducing latency. Surprisingly, this pipelining also improves effectiveness: because multi-step reasoning quality is non-uniform and early steps are more reliable than later ones, working with these reliable early steps instead of the full chain prevents error-prone late steps from misleading downstream agents. We formalize both advantages with the first closed-form joint analysis of stream, serial, and single protocols, deriving the effectiveness ordering, speedup upper bound, and cost ratio. Across eight reasoning benchmarks spanning mathematics, science, and code, two frontier LLMs (Claude Opus 4.6 and GPT-5.4), and three topologies (Chain, Tree, Graph), StreamMA outperforms both baselines (avg. +7.3 pp, max +22.4 pp on HMMT 2026; Claude Opus 4.6-high). Beyond these contributions, we discover a "step-level scaling law": increasing per-agent steps consistently improves both effectiveness and efficiency, a new scaling dimension orthogonal to and composable with agent-count scaling.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes