ROAISYJul 9, 2025

SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments

arXiv:2507.06564v114 citationsh-index: 7IROS
Originality Incremental advance
AI Analysis

This work addresses the challenge of enhancing UAV autonomy for applications in sectors like surveillance or delivery, though it appears incremental as it combines existing VLN and NMPC methods.

The paper tackles the problem of autonomous UAV navigation in complex urban environments by integrating vision-and-language navigation with nonlinear model predictive control, resulting in significantly improved navigation success rates and efficiency, especially in unseen environments.

Unmanned Aerial Vehicles (UAVs) have emerged as versatile tools across various sectors, driven by their mobility and adaptability. This paper introduces SkyVLN, a novel framework integrating vision-and-language navigation (VLN) with Nonlinear Model Predictive Control (NMPC) to enhance UAV autonomy in complex urban environments. Unlike traditional navigation methods, SkyVLN leverages Large Language Models (LLMs) to interpret natural language instructions and visual observations, enabling UAVs to navigate through dynamic 3D spaces with improved accuracy and robustness. We present a multimodal navigation agent equipped with a fine-grained spatial verbalizer and a history path memory mechanism. These components allow the UAV to disambiguate spatial contexts, handle ambiguous instructions, and backtrack when necessary. The framework also incorporates an NMPC module for dynamic obstacle avoidance, ensuring precise trajectory tracking and collision prevention. To validate our approach, we developed a high-fidelity 3D urban simulation environment using AirSim, featuring realistic imagery and dynamic urban elements. Extensive experiments demonstrate that SkyVLN significantly improves navigation success rates and efficiency, particularly in new and unseen environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes