AIOct 2, 2025

Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge

arXiv:2510.02557v15 citationsh-index: 4Has CodeProceedings of the 2025 The Seventh International Conference on Distributed Artificial Intelligence
Originality Synthesis-oriented
AI Analysis

It addresses the challenge of orchestrating collaboration in dynamic human-AI teams, which is an incremental step in agentic AI.

This paper tackles the problem of managing complex multi-agent workflows in human-AI teams by proposing the Autonomous Manager Agent as a research challenge, and finds that GPT-5-based agents struggle to optimize goal completion, constraint adherence, and runtime across 20 workflows.

While agentic AI has advanced in automating individual tasks, managing complex multi-agent workflows remains a challenging problem. This paper presents a research vision for autonomous agentic systems that orchestrate collaboration within dynamic human-AI teams. We propose the Autonomous Manager Agent as a core challenge: an agent that decomposes complex goals into task graphs, allocates tasks to human and AI workers, monitors progress, adapts to changing conditions, and maintains transparent stakeholder communication. We formalize workflow management as a Partially Observable Stochastic Game and identify four foundational challenges: (1) compositional reasoning for hierarchical decomposition, (2) multi-objective optimization under shifting preferences, (3) coordination and planning in ad hoc teams, and (4) governance and compliance by design. To advance this agenda, we release MA-Gym, an open-source simulation and evaluation framework for multi-agent workflow orchestration. Evaluating GPT-5-based Manager Agents across 20 workflows, we find they struggle to jointly optimize for goal completion, constraint adherence, and workflow runtime - underscoring workflow management as a difficult open problem. We conclude with organizational and ethical implications of autonomous management systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes