CLIRDec 17, 2020

BERT Goes Shopping: Comparing Distributional Models for Product Representations

arXiv:2012.09807v2719 citations
AI Analysis

This work provides an incremental improvement in product representation accuracy for e-commerce practitioners, offering guidelines for training under various computational and data constraints.

This paper explores the application of BERT-like architectures to e-commerce product representations, proposing Prod2BERT, which is trained using masked session modeling. Through experiments across multiple shops and tasks, Prod2BERT generally outperforms prod2vec, though performance is sensitive to resources and hyperparameters.

Word embeddings (e.g., word2vec) have been applied successfully to eCommerce products through~\textit{prod2vec}. Inspired by the recent performance improvements on several NLP tasks brought by contextualized embeddings, we propose to transfer BERT-like architectures to eCommerce: our model -- ~\textit{Prod2BERT} -- is trained to generate representations of products through masked session modeling. Through extensive experiments over multiple shops, different tasks, and a range of design choices, we systematically compare the accuracy of~\textit{Prod2BERT} and~\textit{prod2vec} embeddings: while~\textit{Prod2BERT} is found to be superior in several scenarios, we highlight the importance of resources and hyperparameters in the best performing models. Finally, we provide guidelines to practitioners for training embeddings under a variety of computational and data constraints.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes