LGAIIRMay 28

LoopFM: Learning frOm HistOrical RePresentations of Foundation Model for Recommendation

arXiv:2605.2928096.4h-index: 8
AI Analysis

For recommendation systems, LoopFM addresses the diminishing transfer ratio in knowledge distillation, enabling more effective use of large foundation models in production.

LoopFM improves knowledge transfer from large foundation models to compact vertical models by using intermediate embeddings as input features, achieving up to 6% AUC improvement on TaobaoAd and doubling transfer ratio over KD in industrial systems with +0.5% to +1.22% conversion gains.

Knowledge distillation (KD) transfers a single scalar prediction from a large foundation model (FM) to compact vertical models (VMs), suffering from diminishing transfer ratio -- the fraction of FM improvement captured by the VM -- as a single scalar cannot convey the rich intermediate knowledge that larger FMs learn. To address this bottleneck, we propose LoopFM (Learning frOm HistOrical ReP*resentations of FM), a framework that opens a high-bandwidth transfer channel by structuring FM intermediate embeddings as input features (e.g., user history sequence) for downstream VMs, without requiring real-time FM inference at serving and architectural coupling between FM and VM. We provide a theoretical framework for LoopFM with a gain decomposition and transfer-ratio analysis. On three public benchmarks, LoopFM demonstrates strong AUC improvements (e.g., 6\%+ on TaobaoAd) and complementary knowledge transfer capability with KD. On industrial-scale systems (billions of examples, trillion-parameter FMs), LoopFM approximately doubles the knowledge transfer ratio on top of KD, delivering a +0.5\% conversion improvement in Y1H1, and a +1.03\% and +1.22\% conversion improvement from two individual launches respectively in Y1H2.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes