IRAIMar 7

Efficient Personalized Reranking with Semi-Autoregressive Generation and Online Knowledge Distillation

arXiv:2603.07107v13 citations
Predicted impact top 39% in IR · last 90 daysOriginality Highly original
AI Analysis

This work provides a more efficient and effective personalized reranking solution for recommender system developers, improving the balance between recommendation quality and real-time inference.

This paper addresses the challenges of high generation quality and low-latency inference in generative reranking models for recommender systems. The authors propose a Personalized Semi-Autoregressive with online knowledge Distillation (PSAD) framework that significantly outperforms state-of-the-art baselines in both ranking performance and inference efficiency on three large-scale public datasets.

Generative models offer a promising paradigm for the final stage reranking in multi-stage recommender systems, with the ability to capture inter-item dependencies within reranked lists. However, their practical deployment still faces two key challenges: (1) an inherent conflict between achieving high generation quality and ensuring low-latency inference, making it difficult to balance the two, and (2) insufficient interaction between user and item features in existing methods. To address these challenges, we propose a novel Personalized Semi-Autoregressive with online knowledge Distillation (PSAD) framework for reranking. In this framework, the teacher model adopts a semi-autoregressive generator to balance generation quality and efficiency, while its ranking knowledge is distilled online into a lightweight scoring network during joint training, enabling real-time and efficient inference. Furthermore, we propose a User Profile Network (UPN) that injects user intent and models interest dynamics, enabling deeper interactions between users and items. Extensive experiments conducted on three large-scale public datasets demonstrate that PSAD significantly outperforms state-of-the-art baselines in both ranking performance and inference efficiency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes