OCLGSYOct 7, 2025

Differentiable Model Predictive Control on the GPU

arXiv:2510.06179v16 citationsh-index: 6
Originality Incremental advance
AI Analysis

This work addresses a computational bottleneck for researchers and practitioners in robotics and control, enabling faster training in reinforcement and imitation learning, though it is incremental as it builds on existing MPC methods.

The paper tackled the bottleneck of sequential optimization in differentiable model predictive control (MPC) by introducing a GPU-accelerated solver, achieving substantial speedups over CPU- and GPU-based baselines and improving state-of-the-art training times on benchmark tasks.

Differentiable model predictive control (MPC) offers a powerful framework for combining learning and control. However, its adoption has been limited by the inherently sequential nature of traditional optimization algorithms, which are challenging to parallelize on modern computing hardware like GPUs. In this work, we tackle this bottleneck by introducing a GPU-accelerated differentiable optimization tool for MPC. This solver leverages sequential quadratic programming and a custom preconditioned conjugate gradient (PCG) routine with tridiagonal preconditioning to exploit the problem's structure and enable efficient parallelization. We demonstrate substantial speedups over CPU- and GPU-based baselines, significantly improving upon state-of-the-art training times on benchmark reinforcement learning and imitation learning tasks. Finally, we showcase the method on the challenging task of reinforcement learning for driving at the limits of handling, where it enables robust drifting of a Toyota Supra through water puddles.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes