RO AIDec 3, 2025

World Models for Autonomous Navigation of Terrestrial Robots from LIDAR Observations

Raul Steinmetz, Fabio Demo Rosa, Victor Augusto Kich, Jair Augusto Bottega, Ricardo Bedin Grando, Daniel Fernando Tello Gamarra

arXiv:2512.03429v13.21 citationsh-index: 11Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology

Originality Incremental advance

AI Analysis

This addresses the problem of sample inefficiency and high-dimensional sensor processing for robotics researchers, offering an incremental improvement over existing methods.

The paper tackles autonomous navigation for terrestrial robots using LIDAR data by proposing a model-based RL framework with a world model and latent encoding, achieving a 100% success rate in simulations compared to model-free baselines that plateaued below 85%.

Autonomous navigation of terrestrial robots using Reinforcement Learning (RL) from LIDAR observations remains challenging due to the high dimensionality of sensor data and the sample inefficiency of model-free approaches. Conventional policy networks struggle to process full-resolution LIDAR inputs, forcing prior works to rely on simplified observations that reduce spatial awareness and navigation robustness. This paper presents a novel model-based RL framework built on top of the DreamerV3 algorithm, integrating a Multi-Layer Perceptron Variational Autoencoder (MLP-VAE) within a world model to encode high-dimensional LIDAR readings into compact latent representations. These latent features, combined with a learned dynamics predictor, enable efficient imagination-based policy optimization. Experiments on simulated TurtleBot3 navigation tasks demonstrate that the proposed architecture achieves faster convergence and higher success rate compared to model-free baselines such as SAC, DDPG, and TD3. It is worth emphasizing that the DreamerV3-based agent attains a 100% success rate across all evaluated environments when using the full dataset of the Turtlebot3 LIDAR (360 readings), while model-free methods plateaued below 85%. These findings demonstrate that integrating predictive world models with learned latent representations enables more efficient and robust navigation from high-dimensional sensory data.

View on arXiv PDF

Similar