Credible Review Detection with Limited Information using Consistency Analysis
This addresses the issue of identifying credible reviews for online platforms, particularly for long-tail items/users where historical data is scarce, though it appears incremental as it builds on prior work with a novel method for a known bottleneck.
The paper tackles the problem of detecting non-credible online reviews (fake, incompetent, or biased) by developing a method that uses consistency analysis with limited information like review texts, ratings, and timestamps, without relying on item/user histories, and shows improvements over state-of-the-art baselines in experiments on real-world datasets.
Online reviews provide viewpoints on the strengths and shortcomings of products/services, influencing potential customers' purchasing decisions. However, the proliferation of non-credible reviews -- either fake (promoting/ demoting an item), incompetent (involving irrelevant aspects), or biased -- entails the problem of identifying credible reviews. Prior works involve classifiers harnessing rich information about items/users -- which might not be readily available in several domains -- that provide only limited interpretability as to why a review is deemed non-credible. This paper presents a novel approach to address the above issues. We utilize latent topic models leveraging review texts, item ratings, and timestamps to derive consistency features without relying on item/user histories, unavailable for "long-tail" items/users. We develop models, for computing review credibility scores to provide interpretable evidence for non-credible reviews, that are also transferable to other domains -- addressing the scarcity of labeled data. Experiments on real-world datasets demonstrate improvements over state-of-the-art baselines.