ROAIHCLGJun 29, 2023

Principles and Guidelines for Evaluating Social Robot Navigation Algorithms

CMUMIT
arXiv:2306.16740v4156 citationsh-index: 94
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of inconsistent evaluation in social navigation for robotics researchers, but it is incremental as it builds on existing benchmarking practices from other fields.

The paper tackles the challenge of evaluating social robot navigation algorithms by proposing principles, guidelines, and a metrics framework to enable fair and repeatable benchmarking, aiming to accelerate progress in the field.

A major challenge to deploying robots widely is navigation in human-populated environments, commonly referred to as social robot navigation. While the field of social navigation has advanced tremendously in recent years, the fair evaluation of algorithms that tackle social navigation remains hard because it involves not just robotic agents moving in static environments but also dynamic human agents and their perceptions of the appropriateness of robot behavior. In contrast, clear, repeatable, and accessible benchmarks have accelerated progress in fields like computer vision, natural language processing and traditional robot navigation by enabling researchers to fairly compare algorithms, revealing limitations of existing solutions and illuminating promising new directions. We believe the same approach can benefit social navigation. In this paper, we pave the road towards common, widely accessible, and repeatable benchmarking criteria to evaluate social robot navigation. Our contributions include (a) a definition of a socially navigating robot as one that respects the principles of safety, comfort, legibility, politeness, social competency, agent understanding, proactivity, and responsiveness to context, (b) guidelines for the use of metrics, development of scenarios, benchmarks, datasets, and simulators to evaluate social navigation, and (c) a design of a social navigation metrics framework to make it easier to compare results from different simulators, robots and datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes