MELGAPFeb 5, 2024

Improved prediction of future user activity in online A/B testing

arXiv:2402.03231v1h-index: 5
Originality Incremental advance
AI Analysis

This work addresses the need for accurate predictions to optimize experiment duration and treatment effect estimates in online A/B testing, representing an incremental improvement over existing methods.

The paper tackles the problem of predicting participant inclusion rates in online A/B tests by introducing a Bayesian nonparametric approach that forecasts both the number of new customers and their observation frequencies, demonstrating superior performance on real and simulated data.

In online randomized experiments or A/B tests, accurate predictions of participant inclusion rates are of paramount importance. These predictions not only guide experimenters in optimizing the experiment's duration but also enhance the precision of treatment effect estimates. In this paper we present a novel, straightforward, and scalable Bayesian nonparametric approach for predicting the rate at which individuals will be exposed to interventions within the realm of online A/B testing. Our approach stands out by offering dual prediction capabilities: it forecasts both the quantity of new customers expected in future time windows and, unlike available alternative methods, the number of times they will be observed. We derive closed-form expressions for the posterior distributions of the quantities needed to form predictions about future user activity, thereby bypassing the need for numerical algorithms such as Markov chain Monte Carlo. After a comprehensive exposition of our model, we test its performance on experiments on real and simulated data, where we show its superior performance with respect to existing alternatives in the literature.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes