IRLGNov 11, 2023

Mitigating Pooling Bias in E-commerce Search via False Negative Estimation

arXiv:2311.06444v34 citationsh-index: 9
Originality Incremental advance
AI Analysis

This addresses a specific issue in e-commerce search systems that can enhance user experience and business outcomes, but it is incremental as it builds on existing negative sampling methods.

The paper tackled the problem of pooling bias in e-commerce search relevance assessment caused by false negatives in negative sampling, and introduced BHNS, which improved performance by mitigating this bias, as validated in experiments including on a public dataset.

Efficient and accurate product relevance assessment is critical for user experiences and business success. Training a proficient relevance assessment model requires high-quality query-product pairs, often obtained through negative sampling strategies. Unfortunately, current methods introduce pooling bias by mistakenly sampling false negatives, diminishing performance and business impact. To address this, we present Bias-mitigating Hard Negative Sampling (BHNS), a novel negative sampling strategy tailored to identify and adjust for false negatives, building upon our original False Negative Estimation algorithm. Our experiments in the Instacart search setting confirm BHNS as effective for practical e-commerce use. Furthermore, comparative analyses on public dataset showcase its domain-agnostic potential for diverse applications.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes