AIHCLGMADec 23, 2023

Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning

arXiv:2312.15160v16 citationsNeural computing & applications (Print)
Originality Incremental advance
AI Analysis

This addresses the challenge of effective human-AI teaming for critical infrastructure protection, though it is incremental by building on existing RL and HitL methods.

The paper tackles the problem of evaluating human-AI collaboration in complex environments, showing that it outperforms human-only or AI-only approaches in a drone defense simulation, with faster learning, lower human effort, and higher performance.

Recent advances in reinforcement learning (RL) and Human-in-the-Loop (HitL) learning have made human-AI collaboration easier for humans to team with AI agents. Leveraging human expertise and experience with AI in intelligent systems can be efficient and beneficial. Still, it is unclear to what extent human-AI collaboration will be successful, and how such teaming performs compared to humans or AI agents only. In this work, we show that learning from humans is effective and that human-AI collaboration outperforms human-controlled and fully autonomous AI agents in a complex simulation environment. In addition, we have developed a new simulator for critical infrastructure protection, focusing on a scenario where AI-powered drones and human teams collaborate to defend an airport against enemy drone attacks. We develop a user interface to allow humans to assist AI agents effectively. We demonstrated that agents learn faster while learning from policy correction compared to learning from humans or agents. Furthermore, human-AI collaboration requires lower mental and temporal demands, reduces human effort, and yields higher performance than if humans directly controlled all agents. In conclusion, we show that humans can provide helpful advice to the RL agents, allowing them to improve learning in a multi-agent setting.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes