LG MLFeb 13, 2025

Learning to Coordinate with Experts

Mohamad H. Danesh, Nguyen X. Khanh, Tu Trinh, Benjamin Plaut

arXiv:2502.09583v24.1h-index: 5Has CodeTrans. Mach. Learn. Res.

Originality Synthesis-oriented

AI Analysis

This addresses the challenge of cost-effective expert collaboration for AI agents in real-world scenarios, presenting an incremental contribution by formalizing a new problem variant and providing tools for evaluation.

The paper tackles the problem of AI agents learning to coordinate with experts in new environments without expert interaction during training, introducing the YRC-0 setting and YRC-Bench benchmark to support research in this area.

When deployed in the real world, AI agents will inevitably face challenges that exceed their individual capabilities. Leveraging assistance from experts, whether humans or highly capable AI systems, can significantly improve both safety and performance in such situations. Since expert assistance is costly, a central challenge is determining when to consult an expert. In this paper, we explore a novel variant of this problem, termed YRC-0, in which an agent must learn to collaborate with an expert in new environments in an unsupervised manner--that is, without interacting with the expert during training. This setting motivates the development of low-cost, robust approaches for training expert-leveraging agents. To support research in this area, we introduce YRC-Bench, an open-source benchmark that instantiates YRC-0 across diverse environments. YRC-Bench provides a standardized Gym-like API, simulated experts, an evaluation pipeline, and implementations of popular baselines. Toward tackling YRC-0, we propose a validation strategy and evaluate a range of learning methods, offering insights that can inform future research. Codebase: github.com/modanesh/YRC-Bench

View on arXiv PDF Code

Similar