ROMar 23

Data Scaling for Navigation in Unknown Environments

arXiv:2601.0944476.72 citationsh-index: 16
AI Analysis

This addresses the problem of real-world generalization for autonomous navigation in unknown environments, with incremental improvements in understanding data scaling effects.

The study tackled the challenge of generalizing imitation-learned navigation policies to unseen environments by analyzing the impact of data quantity and diversity, finding that data diversity reduces navigation errors by ~15% per doubling of geographical locations, enabling zero-shot navigation performance close to environment-specific training.

Generalization of imitation-learned navigation policies to environments unseen in training remains a major challenge. We address this by conducting the first large-scale study of how data quantity and data diversity affect real-world generalization in end-to-end, map-free visual navigation. Using a curated 4,565-hour crowd-sourced dataset collected across 161 locations in 35 countries, we train policies for point goal navigation and evaluate their closed-loop control performance on sidewalk robots operating in four countries, covering 125 km of autonomous driving. Our results show that large-scale training data enables zero-shot navigation in unknown environments, approaching the performance of policies trained with environment-specific demonstrations. Critically, we find that data diversity is far more important than data quantity. Doubling the number of geographical locations in a training set decreases navigation errors by ~15%, while performance benefit from adding data from existing locations saturates with very little data. We also observe that, with noisy crowd-sourced data, simple regression-based models outperform generative and sequence-based architectures. We release our policies, evaluation setup and example videos at https://lasuomela.github.io/navigation_scaling/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes