GTLGMar 16, 2021

Stochastic Bandits for Multi-platform Budget Optimization in Online Advertising

arXiv:2103.10246v243 citations
AI Analysis

This addresses a practical challenge for online advertisers who need to efficiently allocate budgets across platforms, though it is incremental as it builds on prior bandit algorithms.

The paper tackles the problem of optimizing budget allocation across multiple advertising platforms without prior knowledge of ad value, by modeling it as a Stochastic Bandits with Knapsacks problem and extending an existing algorithm to handle discrete and continuous bid spaces. It provides algorithms with improved regret bounds (e.g., O(OPT√(mn/B) + √(mn OPT)) for discrete spaces) and demonstrates performance gains on real-world advertising data.

We study the problem of an online advertising system that wants to optimally spend an advertiser's given budget for a campaign across multiple platforms, without knowing the value for showing an ad to the users on those platforms. We model this challenging practical application as a Stochastic Bandits with Knapsacks problem over $T$ rounds of bidding with the set of arms given by the set of distinct bidding $m$-tuples, where $m$ is the number of platforms. We modify the algorithm proposed in Badanidiyuru \emph{et al.,} to extend it to the case of multiple platforms to obtain an algorithm for both the discrete and continuous bid-spaces. Namely, for discrete bid spaces we give an algorithm with regret $O\left(OPT \sqrt {\frac{mn}{B} }+ \sqrt{mn OPT}\right)$, where $OPT$ is the performance of the optimal algorithm that knows the distributions. For continuous bid spaces the regret of our algorithm is $\tilde{O}\left(m^{1/3} \cdot \min\left\{ B^{2/3}, (m T)^{2/3} \right\} \right)$. When restricted to this special-case, this bound improves over Sankararaman and Slivkins in the regime $OPT \ll T$, as is the case in the particular application at hand. Second, we show an $ Ω\left (\sqrt {m OPT} \right)$ lower bound for the discrete case and an $Ω\left( m^{1/3} B^{2/3}\right)$ lower bound for the continuous setting, almost matching the upper bounds. Finally, we use a real-world data set from a large internet online advertising company with multiple ad platforms and show that our algorithms outperform common benchmarks and satisfy the required properties warranted in the real-world application.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes