IRAIJan 7

Efficient Sequential Recommendation for Long Term User Interest Via Personalization

arXiv:2601.03479v12 citationsh-index: 4Has CodeICDM
Originality Incremental advance
AI Analysis

This work addresses efficiency issues in sequential recommendation systems for real-world applications, offering an incremental improvement to existing models.

The paper tackles the computational inefficiency of transformer-based sequential recommendation models by introducing a method that compresses long user histories into learnable tokens, reducing computational costs while maintaining high recommendation accuracy, as demonstrated in experiments.

Recent years have witnessed success of sequential modeling, generative recommender, and large language model for recommendation. Though the scaling law has been validated for sequential models, it showed inefficiency in computational capacity when considering real-world applications like recommendation, due to the non-linear(quadratic) increasing nature of the transformer model. To improve the efficiency of the sequential model, we introduced a novel approach to sequential recommendation that leverages personalization techniques to enhance efficiency and performance. Our method compresses long user interaction histories into learnable tokens, which are then combined with recent interactions to generate recommendations. This approach significantly reduces computational costs while maintaining high recommendation accuracy. Our method could be applied to existing transformer based recommendation models, e.g., HSTU and HLLM. Extensive experiments on multiple sequential models demonstrate its versatility and effectiveness. Source code is available at \href{https://github.com/facebookresearch/PerSRec}{https://github.com/facebookresearch/PerSRec}.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes