IRAICLNov 13, 2017

Targeted Advertising Based on Browsing History

arXiv:1711.04498v18 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge for advertisers and publishers in targeting ads more effectively by inferring demographic information from limited online interactions, but it is incremental as it builds on existing techniques like word embeddings and SVM.

The paper tackles the problem of estimating online audience demographic attributes from browsing histories by proposing a predictive framework that uses website content and word embeddings with TF-IDF weighting. The method improves prediction accuracy over baseline approaches, though specific numbers are not provided.

Audience interest, demography, purchase behavior and other possible classifications are ex- tremely important factors to be carefully studied in a targeting campaign. This information can help advertisers and publishers deliver advertisements to the right audience group. How- ever, it is not easy to collect such information, especially for the online audience with whom we have limited interaction and minimum deterministic knowledge. In this paper, we pro- pose a predictive framework that can estimate online audience demographic attributes based on their browsing histories. Under the proposed framework, first, we retrieve the content of the websites visited by audience, and represent the content as website feature vectors; second, we aggregate the vectors of websites that audience have visited and arrive at feature vectors representing the users; finally, the support vector machine is exploited to predict the audience demographic attributes. The key to achieving good prediction performance is preparing representative features of the audience. Word Embedding, a widely used tech- nique in natural language processing tasks, together with term frequency-inverse document frequency weighting scheme is used in the proposed method. This new representation ap- proach is unsupervised and very easy to implement. The experimental results demonstrate that the new audience feature representation method is more powerful than existing baseline methods, leading to a great improvement in prediction accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes