A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews
This work addresses the challenge of efficiently processing customer reviews for businesses, though it appears incremental as it integrates and develops existing methods rather than introducing fundamentally new approaches.
The authors tackled the problem of extracting insights from customer reviews by developing a cloud-based machine learning pipeline that combines transformer-based neural networks, vector embeddings, and clustering for topic modeling and keyword extraction. Their system achieved better results than existing solutions for this task, as validated on publicly available datasets.
The efficiency of natural language processing has improved dramatically with the advent of machine learning models, particularly neural network-based solutions. However, some tasks are still challenging, especially when considering specific domains. In this paper, we present a cloud-based system that can extract insights from customer reviews using machine learning methods integrated into a pipeline. For topic modeling, our composite model uses transformer-based neural networks designed for natural language processing, vector embedding-based keyword extraction, and clustering. The elements of our model have been integrated and further developed to meet better the requirements of efficient information extraction, topic modeling of the extracted information, and user needs. Furthermore, our system can achieve better results than this task's existing topic modeling and keyword extraction solutions. Our approach is validated and compared with other state-of-the-art methods using publicly available datasets for benchmarking.