LGOCFeb 8, 2023

Online Resource Allocation: Bandits feedback and Advice on Time-varying Demands

arXiv:2302.04182v21 citationsh-index: 18
AI Analysis

This addresses resource allocation in dynamic environments like online advertising, offering a robust solution for non-stationary demand, though it is incremental as it builds on advice frameworks.

The paper tackles online resource allocation with non-stationary demand and bandit feedback, showing that without advice, algorithms perform poorly, but with online predictions on demand volumes, their proposed algorithm achieves strong theoretical and numerical results compared to existing methods.

We consider a general online resource allocation model with bandit feedback and time-varying demands. While online resource allocation has been well studied in the literature, most existing works make the strong assumption that the demand arrival process is stationary. In practical applications, such as online advertisement and revenue management, however, this process may be exogenous and non-stationary, like the constantly changing internet traffic. Motivated by the recent Online Algorithms with Advice framework [Mitazenmacher and Vassilvitskii, \emph{Commun. ACM} 2022], we explore how online advice can inform policy design. We establish an impossibility result that any algorithm perform poorly in terms of regret without any advice in our setting. In contrast, we design an robust online algorithm that leverages the online predictions on the total demand volumes. Empowered with online advice, our proposed algorithm is shown to have both theoretical performance and promising numerical results compared with other algorithms in literature. We also provide two explicit examples for the time-varying demand scenarios and derive corresponding theoretical performance guarantees. Finally, we adapt our model to a network revenue management problem, and numerically demonstrate that our algorithm can still performs competitively compared to existing baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes