ML LG CP PR APSep 24, 2025

Error Propagation in Dynamic Programming: From Stochastic Control to Option Pricing

arXiv:2509.20239v14.5h-index: 12

Originality Incremental advance

AI Analysis

It addresses a relatively underexplored aspect of error analysis in stochastic control, which is incremental but relevant for financial applications like option pricing.

This paper tackles the problem of error propagation in stochastic optimal control by analyzing how approximation errors accumulate backward in time, applying the findings to American option pricing with rigorous error control.

This paper investigates theoretical and methodological foundations for stochastic optimal control (SOC) in discrete time. We start formulating the control problem in a general dynamic programming framework, introducing the mathematical structure needed for a detailed convergence analysis. The associate value function is estimated through a sequence of approximations combining nonparametric regression methods and Monte Carlo subsampling. The regression step is performed within reproducing kernel Hilbert spaces (RKHSs), exploiting the classical KRR algorithm, while Monte Carlo sampling methods are introduced to estimate the continuation value. To assess the accuracy of our value function estimator, we propose a natural error decomposition and rigorously control the resulting error terms at each time step. We then analyze how this error propagates backward in time-from maturity to the initial stage-a relatively underexplored aspect of the SOC literature. Finally, we illustrate how our analysis naturally applies to a key financial application: the pricing of American options.

View on arXiv PDF

Similar