LG CRNov 3, 2024

Privacy-Preserving Customer Churn Prediction Model in the Context of Telecommunication Industry

Joydeb Kumar Sana, M Sohel Rahman, M Saifur Rahman

arXiv:2411.01447v16.45 citationsh-index: 5

Originality Incremental advance

AI Analysis

This addresses privacy concerns for telecommunications companies using cloud computing, but it is incremental as it builds on existing methods like GANs and aWOE.

The study tackled the problem of preserving privacy in customer churn prediction models for the telecommunications industry by proposing a framework combining Generative Adversarial Networks (GANs) and adaptive Weight-of-Evidence (aWOE), achieving up to 28.9% and 27.9% improvements in accuracy and F-measure, respectively, with an F-measure of 87.1% for a specific model.

Data is the main fuel of a successful machine learning model. A dataset may contain sensitive individual records e.g. personal health records, financial data, industrial information, etc. Training a model using this sensitive data has become a new privacy concern when someone uses third-party cloud computing. Trained models also suffer privacy attacks which leads to the leaking of sensitive information of the training data. This study is conducted to preserve the privacy of training data in the context of customer churn prediction modeling for the telecommunications industry (TCI). In this work, we propose a framework for privacy-preserving customer churn prediction (PPCCP) model in the cloud environment. We have proposed a novel approach which is a combination of Generative Adversarial Networks (GANs) and adaptive Weight-of-Evidence (aWOE). Synthetic data is generated from GANs, and aWOE is applied on the synthetic training dataset before feeding the data to the classification algorithms. Our experiments were carried out using eight different machine learning (ML) classifiers on three openly accessible datasets from the telecommunication sector. We then evaluated the performance using six commonly employed evaluation metrics. In addition to presenting a data privacy analysis, we also performed a statistical significance test. The training and prediction processes achieve data privacy and the prediction classifiers achieve high prediction performance (87.1\% in terms of F-Measure for GANs-aWOE based Naïve Bayes model). In contrast to earlier studies, our suggested approach demonstrates a prediction enhancement of up to 28.9\% and 27.9\% in terms of accuracy and F-measure, respectively.

View on arXiv PDF

Similar