LGFeb 14, 2025

Thompson Sampling for Repeated Newsvendor

arXiv:2502.09900v14 citationsh-index: 1
Originality Incremental advance
AI Analysis

This work addresses inventory management challenges for businesses by providing a novel online learning approach with interpretable insights, though it is incremental in extending Thompson Sampling to a specific domain.

The paper tackles the repeated newsvendor problem in inventory management by applying Thompson Sampling to handle censored feedback, achieving optimal regret bounds and outperforming existing methods like online convex optimization in simulations.

In this paper, we investigate the performance of Thompson Sampling (TS) for online learning with censored feedback, focusing primarily on the classic repeated newsvendor model--a foundational framework in inventory management--and demonstrating how our techniques can be naturally extended to a broader class of problems. We model demand using a Weibull distribution and initialize TS with a Gamma prior to dynamically adjust order quantities. Our analysis establishes optimal (up to logarithmic factors) frequentist regret bounds for TS without imposing restrictive prior assumptions. More importantly, it yields novel and highly interpretable insights on how TS addresses the exploration-exploitation trade-off in the repeated newsvendor setting. Specifically, our results show that when past order quantities are sufficiently large to overcome censoring, TS accurately estimates the unknown demand parameters, leading to near-optimal ordering decisions. Conversely, when past orders are relatively small, TS automatically increases future order quantities to gather additional demand information. Extensive numerical simulations further demonstrate that TS outperforms more conservative and widely-used approaches such as online convex optimization, upper confidence bounds, and myopic Bayesian dynamic programming. This study also lays the foundation for exploring general online learning problems with censored feedback.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes