LGJun 20, 2025

SIDE: Semantic ID Embedding for effective learning from sequences

arXiv:2506.16698v13 citationsh-index: 18
Originality Incremental advance
AI Analysis

This addresses storage and inference cost problems for industrial ad-recommendation systems, representing a strong incremental improvement over existing Semantic ID methods.

The paper tackles the challenge of scaling sequence-based recommendation systems with long user histories by proposing a novel approach using vector quantization to create compact Semantic IDs, achieving 2.4X improvement in normalized entropy gain and 3X reduction in data footprint compared to traditional methods.

Sequence-based recommendations models are driving the state-of-the-art for industrial ad-recommendation systems. Such systems typically deal with user histories or sequence lengths ranging in the order of O(10^3) to O(10^4) events. While adding embeddings at this scale is manageable in pre-trained models, incorporating them into real-time prediction models is challenging due to both storage and inference costs. To address this scaling challenge, we propose a novel approach that leverages vector quantization (VQ) to inject a compact Semantic ID (SID) as input to the recommendation models instead of a collection of embeddings. Our method builds on recent works of SIDs by introducing three key innovations: (i) a multi-task VQ-VAE framework, called VQ fusion that fuses multiple content embeddings and categorical predictions into a single Semantic ID; (ii) a parameter-free, highly granular SID-to-embedding conversion technique, called SIDE, that is validated with two content embedding collections, thereby eliminating the need for a large parameterized lookup table; and (iii) a novel quantization method called Discrete-PCA (DPCA) which generalizes and enhances residual quantization techniques. The proposed enhancements when applied to a large-scale industrial ads-recommendation system achieves 2.4X improvement in normalized entropy (NE) gain and 3X reduction in data footprint compared to traditional SID methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes