Enhancing Customer Churn Prediction With Resampling: A Comparative Study

被引:0
|
作者
Ong, Jia-Xuan [1 ]
Tong, Gee-Kok [1 ]
Khor, Kok-Chin [2 ]
Haw, Su-Cheng [1 ]
机构
[1] Multimedia Univ, Fac Comp & Informat, Persiaran Multimedia, Cyberjaya 63100, Selangor, Malaysia
[2] Univ Tunku Abdul Rahman, Lee Kong Chian Fac Engn & Sci, Jalan Sungai Long, Bandar Sungai Long 43000, Kajang, Malaysia
关键词
Customer churn prediction; imbalance datasets; resampling; oversampling;
D O I
10.18421/TEM133-20
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this competitive business world, accurately predicting customer churn is crucial to maintaining and preventing revenue loss. However, due to the imbalanced nature of customer churn data, traditional machine learning algorithms often fail to identify churned customers accurately. This has led to exploring resampling techniques, demonstrating their efficacy in addressing this issue. However, current studies in the customer churn prediction field frequently overlook the untapped potential of comprehensive investigation and comparison of resampling techniques. Instead of exploring and comparing various resampling methods, many studies predominantly rely on a single resampling method, such as SMOTE. Hence, this paper aims to compare and evaluate the effectiveness of several resampling methods, including oversampling, undersampling, and hybrid techniques. We utilized the benchmark dataset, telecommunication customer churn, from IBM Watson, where approximately 26.5% of the customers have churned, indicating that the data is imbalanced. Our results demonstrate that the combination of random forest with a hybrid sampling method - SMOTE-ENN obtained the best result. The combination yields an F1 score of 95.3% and an accuracy of 96.0%, surpassing the studies that utilized the same dataset. This highlights the benefits of comparing resampling techniques in predicting customer churn, specifically in imbalanced datasets.
引用
收藏
页码:1927 / 1936
页数:10
相关论文
共 50 条
  • [1] Customer churn prediction in imbalanced datasets with resampling methods: A comparative study
    Haddadi, Seyed Jamal
    Farshidvard, Aida
    Silva, Fillipe dos Santos
    dos Reis, Julio Cesar
    Reis, Marcelo da Silva
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [2] A Comparative Assessment of the Performance of Ensemble Learning in Customer Churn Prediction
    Abbasimehr, Hossein
    Setak, Mostafa
    Tarokh, Mohammad
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2014, 11 (06) : 599 - 606
  • [3] Comparative Methods for Personalized Customer Churn Prediction with Sequential Data
    Bayrak, Ahmet Tugrul
    Yuceturk, Guven
    Bahadir, Musa Berat
    Yalcinkaya, Sare Melek
    Demirdag, Melike
    Sayan, Ismail Utku
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, : 222 - 225
  • [4] The comparative analysis and study of Mobile-based customer data churn prediction model
    Lei Jin-hui
    He Jian-jun
    [J]. 2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 4, PROCEEDINGS, 2009, : 524 - 528
  • [5] A Comparative Study of Customer Churn Prediction in Telecom Industry Using Ensemble Based Classifiers
    Mishra, Abinash
    Reddy, U. Srinivasulu
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS (ICICI 2017), 2017, : 721 - 725
  • [6] Customer churn prediction in telecommunications
    Huang, Bingquan
    Kechadi, Mohand Tahar
    Buckley, Brian
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 1414 - 1425
  • [7] Customer Churn Prediction in Telecommunication
    Yildiz, Mumin
    Albayrak, Songul
    [J]. 2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 256 - 259
  • [8] The Study on Feature Selection in Customer Churn Prediction Modeling
    Wu, Yin
    Qi, Jiayin
    Wang, Chen
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3205 - +
  • [9] A Comparative Study for Employee Churn Prediction
    Bahadir, Musa Berat
    Bayrak, Ahmet Tugrul
    Yuceturk, Guven
    Ergun, Pinar
    [J]. 29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [10] An Experimental Study on Four Models of Customer Churn Prediction
    Zhu, Chao
    Qi, Jiayin
    Wang, Chen
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3199 - +