ROMar 18

Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation

arXiv:2603.1751017.5h-index: 9
Predicted impact top 78% in RO · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses the challenge of making robots more adaptable and transparent in human-shared environments, representing an incremental improvement by integrating existing models into a novel pipeline.

The paper tackles the problem of integrating context-dependent human preferences into robot navigation by developing a pipeline that combines foundational models with multi-objective reinforcement learning, resulting in a system that reliably captures user intent and enables controllable behavior adaptation across diverse contexts.

Robots operating in human-shared environments must not only achieve task-level navigation objectives such as safety and efficiency, but also adapt their behavior to human preferences. However, as human preferences are typically expressed in natural language and depend on environmental context, it is difficult to directly integrate them into low-level robot control policies. In this work, we present a pipeline that enables robots to understand and apply context-dependent navigation preferences by combining foundational models with a Multi-Objective Reinforcement Learning (MORL) navigation policy. Thus, our approach integrates high-level semantic reasoning with low-level motion control. A Vision-Language Model (VLM) extracts structured environmental context from onboard visual observations, while Large Language Models (LLM) convert natural language user feedback into interpretable, context-dependent behavioral rules stored in a persistent but updatable rule memory. A preference translation module then maps contextual information and stored rules into numerical preference vectors that parameterize a pretrained MORL policy for real-time navigation adaptation. We evaluate the proposed framework through quantitative component-level evaluations, a user study, and real-world robot deployments in various indoor environments. Our results demonstrate that the system reliably captures user intent, generates consistent preference vectors, and enables controllable behavior adaptation across diverse contexts. Overall, the proposed pipeline improves the adaptability, transparency, and usability of robots operating in shared human environments, while maintaining safe and responsive real-time control.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes