A proposed hybrid framework to improve the accuracy of customer churn prediction in telecom industry

被引:0
|
作者
Ouf, Shimaa [1 ]
Mahmoud, Kholoud T. [1 ]
Abdel-Fattah, Manal A. [2 ]
机构
[1] Helwan Univ, Fac Commerce & Business Adm, Dept Informat Syst, Cairo, Egypt
[2] Helwan Univ, Fac Comp & Artificial Intelligence, Dept Informat Syst, Cairo, Egypt
关键词
Telecom industry; Churn prediction; Machine learning; XGBOOST classifier; Hybrid framework; Data preprocessing; TELECOMMUNICATION; MODEL;
D O I
10.1186/s40537-024-00922-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the telecom sector, predicting customer churn has increased in importance in recent years. Developing a robust and accurate churn prediction model takes time, but it is crucial. Early churn prediction avoids revenue loss and improves customer retention. Telecom companies must identify these customers before they leave to solve this issue. Researchers have used a variety of applied machine-learning approaches to reveal the hidden relationships between different features. A key aspect of churn prediction is the accuracy level that affects the learning model's performance. This study aims to clarify several aspects of customer churn prediction accuracy and investigate state-of-the-art techniques' performance. However, no previous research has investigated performance using a hybrid framework combining the advantages of selecting suitable data preprocessing, ensemble learning, and resampling techniques. The study introduces a proposed hybrid framework that improves the accuracy of customer churn prediction in the telecom industry. The framework is built by integrating the XGBOOST classifier with the hybrid resampling method SMOTE-ENN, which concerns applying effective techniques for data preprocessing. The proposed framework is used for two experiments with three datasets in the telecom industry. This study determines which features are most crucial and influence customer churn, introduces the impact of data balancing, compares the classifiers' pre- and post-data balancing performances, and examines a speed-accuracy trade-off in hybrid classifiers. Many metrics, including accuracy, precision, recall, F1-score, and ROC curve, are used to analyze the results. All evaluation criteria are used to identify the most effective experiment. The results of the accuracy of the hybrid framework that respects balanced data outperformed applying the classifier only to imbalanced data. In addition, the results of the proposed hybrid framework are compared to previous studies on the same datasets, and the result of this comparison is offered. Compared with the review of the latest works, our proposed hybrid framework with the three datasets outperformed these works.
引用
收藏
页数:27
相关论文
共 50 条
  • [41] An ensemble based approach using a combination of clustering and classification algorithms to enhance customer churn prediction in telecom industry
    Bilal, Syed Fakhar
    Almazroi, Abdulwahab Ali
    Bashir, Saba
    Khan, Farhan Hassan
    Almazroi, Abdulaleem Ali
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [42] An ensemble based approach using a combination of clustering and classification algorithms to enhance customer churn prediction in telecom industry
    Bilal S.F.
    Almazroi A.A.
    Bashir S.
    Khan F.H.
    Almazroi A.A.
    PeerJ Computer Science, 2022, 8
  • [43] Partition cost-sensitive CART based on customer value for Telecom customer churn prediction
    Wang, Chuanqi
    Li, Ruiqi
    Wang, Peng
    Chen, Zonghai
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 5680 - 5684
  • [44] Customer Churn Prediction Based on HMM in Telecommunication Industry
    Zhu, Huisheng
    Yu, Bin
    FUZZY SYSTEMS AND DATA MINING VI, 2020, 331 : 78 - 92
  • [45] Customer churn prediction in telecom using machine learning in big data platform
    Ahmad, Abdelrahim Kasem
    Jafar, Assef
    Aljoumaa, Kadan
    JOURNAL OF BIG DATA, 2019, 6 (01)
  • [46] Customer churn prediction in telecom using machine learning in big data platform
    Abdelrahim Kasem Ahmad
    Assef Jafar
    Kadan Aljoumaa
    Journal of Big Data, 6
  • [47] Telecom Customer Churn Prediction based on Half Termination Dynamic Label and XGBoost
    Zhang, Yi
    Zhang, Fan
    Song, Chuntao
    Cheng, Xinzhou
    Cheng, Chen
    Xu, Lexi
    Xiao, Tian
    Li, Bei
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1563 - 1568
  • [48] An Efficient Hybrid Classifier Model for Customer Churn Prediction
    Anitha, M. A.
    Sherly, K. K.
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2023, 69 (01) : 11 - 18
  • [49] A Hybrid Data Mining Method for Customer Churn Prediction
    Jamalian, Elham
    Foukerdi, Rahim
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2018, 8 (03) : 2991 - 2997
  • [50] Hybrid ensemble learning approaches to customer churn prediction
    Tavassoli, Sara
    Koosha, Hamidreza
    KYBERNETES, 2022, 51 (03) : 1062 - 1088