Context-Aware Learning to Rank with Self-Attention
This addresses the limitation of ignoring item interactions during inference in ranking systems for e-commerce search, though it is incremental as it builds on existing self-attention methods.
The paper tackles the problem of learning to rank in e-commerce search by proposing a context-aware neural network model that uses self-attention to consider item interactions during both training and inference, resulting in significant performance gains over baselines and new state-of-the-art results on the MSLR-WEB30K benchmark.
Learning to rank is a key component of many e-commerce search engines. In learning to rank, one is interested in optimising the global ordering of a list of items according to their utility for users.Popular approaches learn a scoring function that scores items individually (i.e. without the context of other items in the list) by optimising a pointwise, pairwise or listwise loss. The list is then sorted in the descending order of the scores. Possible interactions between items present in the same list are taken into account in the training phase at the loss level. However, during inference, items are scored individually, and possible interactions between them are not considered. In this paper, we propose a context-aware neural network model that learns item scores by applying a self-attention mechanism. The relevance of a given item is thus determined in the context of all other items present in the list, both in training and in inference. We empirically demonstrate significant performance gains of self-attention based neural architecture over Multi-LayerPerceptron baselines, in particular on a dataset coming from search logs of a large scale e-commerce marketplace, Allegro.pl. This effect is consistent across popular pointwise, pairwise and listwise losses.Finally, we report new state-of-the-art results on MSLR-WEB30K, the learning to rank benchmark.