Deep Active Inference with Diffusion Policy and Multiple Timescale World Model for Real-World Exploration and Navigation
This work addresses the problem of real-world robotic navigation by unifying exploration and goal-directed behaviors, representing an incremental improvement over existing active inference methods.
The paper tackled autonomous robotic navigation by proposing a deep active inference framework that integrates a diffusion policy and a multiple timescale world model, achieving higher success rates and fewer collisions in real-world experiments compared to baselines.
Autonomous robotic navigation in real-world environments requires exploration to acquire environmental information as well as goal-directed navigation in order to reach specified targets. Active inference (AIF) based on the free-energy principle provides a unified framework for these behaviors by minimizing the expected free energy (EFE), thereby combining epistemic and extrinsic values. To realize this practically, we propose a deep AIF framework that integrates a diffusion policy as the policy model and a multiple timescale recurrent state-space model (MTRSSM) as the world model. The diffusion policy generates diverse candidate actions while the MTRSSM predicts their long-horizon consequences through latent imagination, enabling action selection that minimizes EFE. Real-world navigation experiments demonstrated that our framework achieved higher success rates and fewer collisions compared with the baselines, particularly in exploration-demanding scenarios. These results highlight how AIF based on EFE minimization can unify exploration and goal-directed navigation in real-world robotic settings.