ROAILGOct 30, 2025

Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments

arXiv:2510.26646v11 citationsh-index: 1
Originality Incremental advance
AI Analysis

This work addresses path-planning for robots in dynamic settings, offering incremental improvements through a hybrid reinforcement learning framework.

The paper tackles autonomous navigation in dynamic environments by combining DQN for high-level sub-goal selection and TD3 for low-level control, resulting in improved success rates and sample efficiency over baselines like single-algorithm approaches and rule-based planners.

This paper presents a hierarchical path-planning and control framework that combines a high-level Deep Q-Network (DQN) for discrete sub-goal selection with a low-level Twin Delayed Deep Deterministic Policy Gradient (TD3) controller for continuous actuation. The high-level module selects behaviors and sub-goals; the low-level module executes smooth velocity commands. We design a practical reward shaping scheme (direction, distance, obstacle avoidance, action smoothness, collision penalty, time penalty, and progress), together with a LiDAR-based safety gate that prevents unsafe motions. The system is implemented in ROS + Gazebo (TurtleBot3) and evaluated with PathBench metrics, including success rate, collision rate, path efficiency, and re-planning efficiency, in dynamic and partially observable environments. Experiments show improved success rate and sample efficiency over single-algorithm baselines (DQN or TD3 alone) and rule-based planners, with better generalization to unseen obstacle configurations and reduced abrupt control changes. Code and evaluation scripts are available at the project repository.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes