ROAICLCVOct 9, 2025

NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions

arXiv:2510.08173v13 citationsh-index: 8
Originality Incremental advance
AI Analysis

This work addresses the problem of evaluating spatial intelligence in embodied navigation for AI and robotics researchers, though it is incremental as it builds on prior benchmarks by adding spatial focus.

The authors tackled the lack of systematic evaluation for spatial perception and reasoning in navigation agents by introducing the NavSpace benchmark with 1,228 trajectory-instruction pairs across six categories, and they proposed SNav, a new model that outperforms 22 existing agents on this benchmark and in real robot tests.

Instruction-following navigation is a key step toward embodied intelligence. Prior benchmarks mainly focus on semantic understanding but overlook systematically evaluating navigation agents' spatial perception and reasoning capabilities. In this work, we introduce the NavSpace benchmark, which contains six task categories and 1,228 trajectory-instruction pairs designed to probe the spatial intelligence of navigation agents. On this benchmark, we comprehensively evaluate 22 navigation agents, including state-of-the-art navigation models and multimodal large language models. The evaluation results lift the veil on spatial intelligence in embodied navigation. Furthermore, we propose SNav, a new spatially intelligent navigation model. SNav outperforms existing navigation agents on NavSpace and real robot tests, establishing a strong baseline for future work.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes