Combat AI With AI: Counteract Machine-Generated Fake Restaurant Reviews on Social Media
This addresses the challenge for social media platforms in combating AI-generated fake reviews, though it is incremental as it builds on existing detection methods.
The paper tackles the problem of detecting machine-generated fake restaurant reviews on social media by fine-tuning a GPT output detector using high-quality elite reviews from Yelp, achieving significant outperformance over existing solutions. It also analyzes patterns in non-elite reviews across dimensions like review characteristics and writing style.
Recent advances in generative models such as GPT may be used to fabricate indistinguishable fake customer reviews at a much lower cost, thus posing challenges for social media platforms to detect these machine-generated fake reviews. We propose to leverage the high-quality elite restaurant reviews verified by Yelp to generate fake reviews from the OpenAI GPT review creator and ultimately fine-tune a GPT output detector to predict fake reviews that significantly outperform existing solutions. We further apply the model to predict non-elite reviews and identify the patterns across several dimensions, such as review, user and restaurant characteristics, and writing style. We show that social media platforms are continuously challenged by machine-generated fake reviews, although they may implement detection systems to filter out suspicious reviews.