IRAINov 30, 2023

Beyond Two-Tower Matching: Learning Sparse Retrievable Cross-Interactions for Recommendation

Stanford
arXiv:2311.18213v123 citationsh-index: 44
Originality Highly original
AI Analysis

This addresses the problem of balancing accuracy and efficiency in large-scale recommendation systems for industrial applications, representing a novel method rather than an incremental improvement.

The paper tackles the limited feature interaction capability and reduced accuracy of two-tower models in recommendation systems by proposing SparCode, a new matching paradigm that improves candidate item matching accuracy while maintaining retrieval efficiency, as demonstrated on open benchmark datasets.

Two-tower models are a prevalent matching framework for recommendation, which have been widely deployed in industrial applications. The success of two-tower matching attributes to its efficiency in retrieval among a large number of items, since the item tower can be precomputed and used for fast Approximate Nearest Neighbor (ANN) search. However, it suffers two main challenges, including limited feature interaction capability and reduced accuracy in online serving. Existing approaches attempt to design novel late interactions instead of dot products, but they still fail to support complex feature interactions or lose retrieval efficiency. To address these challenges, we propose a new matching paradigm named SparCode, which supports not only sophisticated feature interactions but also efficient retrieval. Specifically, SparCode introduces an all-to-all interaction module to model fine-grained query-item interactions. Besides, we design a discrete code-based sparse inverted index jointly trained with the model to achieve effective and efficient model inference. Extensive experiments have been conducted on open benchmark datasets to demonstrate the superiority of our framework. The results show that SparCode significantly improves the accuracy of candidate item matching while retaining the same level of retrieval efficiency with two-tower models. Our source code will be available at MindSpore/models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes