SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning
This addresses the problem of fragile, manually designed prompts in multi-agent systems for researchers and practitioners, though it appears incremental as it builds on existing multi-agent and reasoning methods.
The paper tackles the challenge of optimizing multi-agent AI systems by introducing SiriuS, a self-improving framework that builds an experience library of reasoning trajectories, which boosts performance by 2.86% to 21.88% on reasoning and biomedical QA tasks.
Multi-agent AI systems powered by large language models (LLMs) are increasingly applied to solve complex tasks. However, these systems often rely on fragile, manually designed prompts and heuristics, making optimization difficult. A key challenge in optimizing multi-agent systems is acquiring suitable training data for specialized agents. We introduce SiriuS, a self-improving, reasoning-driven optimization framework for multi-agent systems. Central to our approach is the construction of an experience library: a repository of high-quality reasoning trajectories. The library is built by retaining reasoning steps that lead to successful outcomes, providing a robust training set for optimizing multi-agent system. Additionally, we introduce a library augmentation procedure that refines unsuccessful trajectories, further enriching the library. SiriuS boosts performance by 2.86\% to 21.88\% on reasoning and biomedical QA and enhances agent negotiation in competitive settings. Our results show that SiriuS enhances multi-agent performance while generating reusable data for self-correction and self-play enhancement in the future.