CLIRLGDec 12, 2020

Yelp Review Rating Prediction: Machine Learning and Deep Learning Models

arXiv:2012.06690v116 citations
AI Analysis

This paper addresses the problem of predicting restaurant ratings from text reviews for Yelp users, providing an incremental improvement in classification accuracy.

This paper predicts Yelp restaurant ratings from review text using machine learning and deep learning models. XLNet achieved 70% accuracy for 5-star classification, outperforming Logistic Regression which reached 64% accuracy.

We predict restaurant ratings from Yelp reviews based on Yelp Open Dataset. Data distribution is presented, and one balanced training dataset is built. Two vectorizers are experimented for feature engineering. Four machine learning models including Naive Bayes, Logistic Regression, Random Forest, and Linear Support Vector Machine are implemented. Four transformer-based models containing BERT, DistilBERT, RoBERTa, and XLNet are also applied. Accuracy, weighted F1 score, and confusion matrix are used for model evaluation. XLNet achieves 70% accuracy for 5-star classification compared with Logistic Regression with 64% accuracy.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes