AIJan 8, 2019

Forecasting Granular Audience Size for Online Advertising

arXiv:1901.02412v13 citations
AI Analysis

This work addresses a specific challenge in online advertising for marketers by providing a more efficient and accurate forecasting method, though it is incremental as it builds on existing frequent itemset mining techniques.

The paper tackled the problem of forecasting audience size for online advertising at a granular level by modifying the Eclat frequent itemset mining algorithm to handle categorical variables and using time series analysis with conditional probabilities for forecasting. The result showed that the method lowered computation time for frequent itemset mining on categorical data and outperformed baselines, including neural network models, on hold-out samples.

Orchestration of campaigns for online display advertising requires marketers to forecast audience size at the granularity of specific attributes of web traffic, characterized by the categorical nature of all attributes (e.g. {US, Chrome, Mobile}). With each attribute taking many values, the very large attribute combination set makes estimating audience size for any specific attribute combination challenging. We modify Eclat, a frequent itemset mining (FIM) algorithm, to accommodate categorical variables. For consequent frequent and infrequent itemsets, we then provide forecasts using time series analysis with conditional probabilities to aid approximation. An extensive simulation, based on typical characteristics of audience data, is built to stress test our modified-FIM approach. In two real datasets, comparison with baselines including neural network models, shows that our method lowers computation time of FIM for categorical data. On hold out samples we show that the proposed forecasting method outperforms these baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes