Learning to Spend: Model Predictive Control for Budgeting under Non-Stationary Returns

Nilavra Pathak, Smriti Shyamal, Prasant Mhasker, Christopher Swartz

arXiv:2604.271861.6

Predicted impact top 81% in SY · last 90 daysOriginality Synthesis-oriented

AI Analysis

For practitioners in digital marketing and resource allocation, this clarifies when predictive control is beneficial versus when simpler reactive policies suffice.

The paper studies budget allocation as a closed-loop control problem, comparing Model Predictive Control (MPC) to reactive policies. It finds MPC outperforms reactive baselines only when return efficiency has predictable structure, offering no advantage under stationary or unpredictable drift.

We study finite-horizon budget allocation as a closed-loop economic control problem and evaluate receding-horizon Model Predictive Control (MPC) relative to reactive budgeting policies. Budgets are allocated periodically under execution noise and operational constraints, while return efficiency may evolve over time. Using a controlled simulation framework motivated by digital marketing, we compare reactive pacing to MPC across environments with increasing degrees of non-stationarity. Our results show that non-stationarity alone does not justify predictive control. When return dynamics are stationary or evolve through unpredictable stochastic drift, MPC offers no systematic advantage over reactive baselines. By contrast, when return efficiency exhibits predictable structure over the planning horizon, that is captured through an underlying model, MPC consistently outperforms reactive budgeting by exploiting intertemporal trade-offs.

View on arXiv PDF

Similar