Generalized Model Predictive Path Integral Control as Expectation--Maximization
For researchers in stochastic optimal control and robotics, this work provides a theoretical foundation and generalization of MPPI, but the results are incremental as they reinterpret existing methods.
The paper shows that Model Predictive Path Integral (MPPI) control is a special case of Expectation-Maximization (EM) applied to probabilistic optimal control, and generalizes MPPI beyond Gaussian parameterizations. It provides convergence analysis, including local convergence rates and sufficient increase properties for exponential-family distributions.
Model Predictive Path Integral (MPPI) control is a powerful sampling-based method for solving stochastic optimal control problems and has enabled real-time control in complex robotic systems. Despite its empirical success, its theoretical understanding remains limited. In this work, we show that MPPI can be interpreted as a special case of the Expectation-Maximization (EM) algorithm applied to a probabilistic inference formulation of optimal control. This perspective leads to a generalized EM-MPPI framework that extends MPPI beyond the commonly used Gaussian parameterization. We analyze the convergence behavior of this algorithm and characterize the local convergence rate in terms of the covariance of the posterior trajectory distribution and the exploration distribution. For exponential-family distributions, we establish a sufficient increase property of the log-likelihood when the log-partition function is strongly convex. Specializing the analysis to Gaussian MPPI yields explicit global and local convergence characterizations. The code for the experiments will be available upon acceptance.