IRLGDec 19, 2025

Warmer for Less: A Cost-Efficient Strategy for Cold-Start Recommendations at Pinterest

arXiv:2512.17277v1h-index: 2
Originality Incremental advance
AI Analysis

This work addresses the cold-start recommendation problem for large-scale platforms like Pinterest, offering an incremental but practical solution tailored to industrial constraints.

The paper tackles the problem of improving recommender system predictions for cold-start items at Pinterest by developing a cost-efficient solution that addresses computational constraints, feature representation, prediction bias, and data sparsity, resulting in a 10% increase in fresh content engagement without negative impact on overall engagement or cost.

Pinterest is a leading visual discovery platform where recommender systems (RecSys) are key to delivering relevant, engaging, and fresh content to our users. In this paper, we study the problem of improving RecSys model predictions for cold-start (CS) items, which appear infrequently in the training data. Although this problem is well-studied in academia, few studies have addressed its root causes effectively at the scale of a platform like Pinterest. By investigating live traffic data, we identified several challenges of the CS problem and developed a corresponding solution for each: First, industrial-scale RecSys models must operate under tight computational constraints. Since CS items are a minority, any related improvements must be highly cost-efficient. To address this, our solutions were designed to be lightweight, collectively increasing the total parameters by only 5%. Second, CS items are represented only by non-historical (e.g., content or attribute) features, which models often treat as less important. To elevate their significance, we introduce a residual connection for the non-historical features. Third, CS items tend to receive lower prediction scores compared to non-CS items, reducing their likelihood of being surfaced. We mitigate this by incorporating a score regularization term into the model. Fourth, the labels associated with CS items are sparse, making it difficult for the model to learn from them. We apply the manifold mixup technique to address this data sparsity. Implemented together, our methods increased fresh content engagement at Pinterest by 10% without negatively impacting overall engagement and cost, and have been deployed to serve over 570 million users on Pinterest.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes