IRAIMar 4, 2025

Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations

arXiv:2503.02453v160 citationsh-index: 5
Originality Incremental advance
AI Analysis

This work addresses a critical problem in recommendation systems for large-scale platforms, offering a novel integration approach that is incremental but impactful.

The paper tackles the challenge of integrating generative and dense retrieval methods in recommendation systems by introducing the COBRA framework, which combines sparse semantic IDs and dense vectors through a cascaded process, resulting in substantial improvements in key metrics on a real-world advertising platform with over 200 million daily users.

Generative models have recently gained attention in recommendation systems by directly predicting item identifiers from user interaction sequences. However, existing methods suffer from significant information loss due to the separation of stages such as quantization and sequence modeling, hindering their ability to achieve the modeling precision and accuracy of sequential dense retrieval techniques. Integrating generative and dense retrieval methods remains a critical challenge. To address this, we introduce the Cascaded Organized Bi-Represented generAtive retrieval (COBRA) framework, which innovatively integrates sparse semantic IDs and dense vectors through a cascading process. Our method alternates between generating these representations by first generating sparse IDs, which serve as conditions to aid in the generation of dense vectors. End-to-end training enables dynamic refinement of dense representations, capturing both semantic insights and collaborative signals from user-item interactions. During inference, COBRA employs a coarse-to-fine strategy, starting with sparse ID generation and refining them into dense vectors via the generative model. We further propose BeamFusion, an innovative approach combining beam search with nearest neighbor scores to enhance inference flexibility and recommendation diversity. Extensive experiments on public datasets and offline tests validate our method's robustness. Online A/B tests on a real-world advertising platform with over 200 million daily users demonstrate substantial improvements in key metrics, highlighting COBRA's practical advantages.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes