AIGTHCROSep 12, 2021

A Socially Aware Reinforcement Learning Agent for The Single Track Road Problem

arXiv:2109.05486v3
Originality Incremental advance
AI Analysis

This addresses coordination challenges in human-AI interaction for autonomous systems, but it is incremental as it builds on existing reinforcement learning methods in a specific scenario.

The paper tackles the single-track road problem where a human and an autonomous agent must coordinate to pass each other, showing that a reinforcement learning agent maximizing a combination of human and its own utility achieves high performance and significantly outperforms baselines like a self-utility-only agent.

We present the single track road problem. In this problem two agents face each-other at opposite positions of a road that can only have one agent pass at a time. We focus on the scenario in which one agent is human, while the other is an autonomous agent. We run experiments with human subjects in a simple grid domain, which simulates the single track road problem. We show that when data is limited, building an accurate human model is very challenging, and that a reinforcement learning agent, which is based on this data, does not perform well in practice. However, we show that an agent that tries to maximize a linear combination of the human's utility and its own utility, achieves a high score, and significantly outperforms other baselines, including an agent that tries to maximize only its own utility.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes