LGIRMLAug 24, 2020

Two Stages Approach for Tweet Engagement Prediction

arXiv:2008.10419v1
Originality Synthesis-oriented
AI Analysis

This work addresses tweet engagement prediction for social media platforms, but it is incremental as it applies existing methods to a specific dataset without major innovations.

The paper tackled the problem of predicting user engagement with tweets by proposing a two-stage approach that combines heterogeneous features from handcrafted features, knowledge graph embeddings, sentiment analysis, and BERT word embeddings, and uses an XGBoost ensemble, achieving a rank of 22 in the RecSys Challenge 2020 leaderboard.

This paper describes the approach proposed by the D2KLab team for the 2020 RecSys Challenge on the task of predicting user engagement facing tweets. This approach relies on two distinct stages. First, relevant features are learned from the challenge dataset. These features are heterogeneous and are the results of different learning modules such as handcrafted features, knowledge graph embeddings, sentiment analysis features and BERT word embeddings. Second, these features are provided in input to an ensemble system based on XGBoost. This approach, only trained on a subset of the entire challenge dataset, ranked 22 in the final leaderboard.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes