LG EM MLSep 28, 2020

Learning Classifiers under Delayed Feedback with a Time Window Assumption

arXiv:2009.13092v24.28 citations

Originality Incremental advance

AI Analysis

This work addresses delayed feedback in applications like online ads, offering an incremental improvement over existing methods for more accurate conversion prediction.

The paper tackles the problem of training binary classifiers under delayed feedback, where labels change over time, by proposing a method with an unbiased and convex empirical risk using all samples under a time window assumption. Experimental results on synthetic and real online advertising datasets demonstrate its effectiveness, though specific performance numbers are not provided.

We consider training a binary classifier under delayed feedback (\emph{DF learning}). For example, in the conversion prediction in online ads, we initially receive negative samples that clicked the ads but did not buy an item; subsequently, some samples among them buy an item then change to positive. In the setting of DF learning, we observe samples over time, then learn a classifier at some point. We initially receive negative samples; subsequently, some samples among them change to positive. This problem is conceivable in various real-world applications such as online advertisements, where the user action takes place long after the first click. Owing to the delayed feedback, naive classification of the positive and negative samples returns a biased classifier. One solution is to use samples that have been observed for more than a certain time window assuming these samples are correctly labeled. However, existing studies reported that simply using a subset of all samples based on the time window assumption does not perform well, and that using all samples along with the time window assumption improves empirical performance. We extend these existing studies and propose a method with the unbiased and convex empirical risk that is constructed from all samples under the time window assumption. To demonstrate the soundness of the proposed method, we provide experimental results on a synthetic and open dataset that is the real traffic log datasets in online advertising.

View on arXiv PDF

Similar