Reachability-Augmented Dual Dynamic Programming for Optimal Path Parameterization

Yunan Wang, Jizhou Yan, Chuxiong Hu, Zeyang Li

arXiv:2605.1908918.3

AI Analysis

For robotics and control practitioners, RDDP unifies feasibility guarantees, optimality certificates, and computational efficiency in a single framework for optimal path parameterization, addressing a key capability gap.

The paper proposes Reachability-Augmented Dual Dynamic Programming (RDDP) for optimal path parameterization, achieving objective values comparable to convex-optimization baselines while reducing computation time by 28.6× for OPP2 and 5.8× for OPP3, and providing global optimality guarantees for convex OPP and KKT convergence for non-convex OPP.

Optimal path parameterization (OPP) is a fundamental problem for planning trajectories along a prescribed geometric path under kinodynamic constraints and task-dependent objectives. While TOPP minimizes traversal time, its saturating states and controls may induce vibration and tracking errors, which can be mitigated by introducing smoothness objectives. However, a key capability gap remains in OPP: feasibility guarantees, general-objective optimality certificates, and computational efficiency are difficult to achieve simultaneously in a unified framework, especially for third-order OPP (OPP3) with non-convex constraints. This paper proposes reachability-augmented dual dynamic programming (RDDP), a state-grid-free and objective-aware DP framework for OPP. The key idea is to replace the relatively complete recourse assumption used in classical dual DP (DDP) with OPP-specific backward reachable sets, and then generate both value-function cuts and trial trajectories only inside these reachable sets. For convex and non-convex OPP, we prove global optimality and Karush-Kuhn-Tucker convergence of RDDP under OPP-specific conditions, respectively. Efficient instantiations are developed for OPP2 and OPP3. Experiments show that RDDP achieves objective values comparable to convex-optimization baselines while reducing computation time by 28.6 times for OPP2 and 5.8 times for OPP3. RDDP also achieves faster convergence than grid-based DP. Compared with reachability-analysis methods, RDDP retains the reachability mechanism while replacing local maximum-control propagation with value-function-guided control selection, thereby enabling objectives beyond traversal time. In summary, RDDP addresses a key capability gap in OPP by unifying certifiable general-objective optimization, reachability-based feasibility preservation, and online-compatible low-dimensional DP computation in a single OPP framework.

View on arXiv PDF

Similar