AIJun 2, 2023

PDT: Pretrained Dual Transformers for Time-aware Bipartite Graphs

arXiv:2306.01913v31 citationsh-index: 26
Originality Incremental advance
AI Analysis

This work addresses the challenge of optimizing pre-training for user-content interaction data in recommendation systems, though it appears incremental as it builds on existing pre-training and contrastive learning approaches.

The paper tackled the problem of learning contextual knowledge from user-content interaction data for recommendation tasks by proposing a pre-training method with a dual-Transformer architecture and contrastive learning, resulting in significant performance gains over baselines.

Pre-training on large models is prevalent and emerging with the ever-growing user-generated content in many machine learning application categories. It has been recognized that learning contextual knowledge from the datasets depicting user-content interaction plays a vital role in downstream tasks. Despite several studies attempting to learn contextual knowledge via pre-training methods, finding an optimal training objective and strategy for this type of task remains a challenging problem. In this work, we contend that there are two distinct aspects of contextual knowledge, namely the user-side and the content-side, for datasets where user-content interaction can be represented as a bipartite graph. To learn contextual knowledge, we propose a pre-training method that learns a bi-directional mapping between the spaces of the user-side and the content-side. We formulate the training goal as a contrastive learning task and propose a dual-Transformer architecture to encode the contextual knowledge. We evaluate the proposed method for the recommendation task. The empirical studies have demonstrated that the proposed method outperformed all the baselines with significant gains.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes