Research on customer churn prediction and model interpretability analysis

被引:4
|
作者
Peng, Ke [1 ]
Peng, Yan [1 ]
Li, Wenguang [1 ]
机构
[1] Sichuan Univ Sci & Engn, Coll Comp Sci & Engn, Yibin, Peoples R China
来源
PLOS ONE | 2023年 / 18卷 / 12期
关键词
D O I
10.1371/journal.pone.0289724
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In recent years, with the continuous improvement of the financial system and the rapid development of the banking industry, the competition of the banking industry itself has intensified. At the same time, with the rapid development of information technology and Internet technology, customers' choice of financial products is becoming more and more diversified, and customers' dependence and loyalty to banking institutions is becoming less and less, and the problem of customer churn in commercial banks is becoming more and more prominent. How to predict customer behavior and retain existing customers has become a major challenge for banks to solve. Therefore, this study takes a bank's business data on Kaggle platform as the research object, uses multiple sampling methods to compare the data for balancing, constructs a bank customer churn prediction model for churn identification by GA-XGBoost, and conducts interpretability analysis on the GA-XGBoost model to provide decision support and suggestions for the banking industry to prevent customer churn. The results show that: (1) The applied SMOTEENN is more effective than SMOTE and ADASYN in dealing with the imbalance of banking data. (2) The F1 and AUC values of the model improved and optimized by XGBoost using genetic algorithm can reach 90% and 99%, respectively, which are optimal compared to other six machine learning models. The GA-XGBoost classifier was identified as the best solution for the customer churn problem. (3) Using Shapley values, we explain how each feature affects the model results, and analyze the features that have a high impact on the model prediction, such as the total number of transactions in the past year, the amount of transactions in the past year, the number of products owned by customers, and the total sales balance. The contribution of this paper is mainly in two aspects: (1) this study can provide useful information from the black box model based on the accurate identification of churned customers, which can provide reference for commercial banks to improve their service quality and retain customers; (2) it can provide reference for customer churn early warning models of other related industries, which can help the banking industry to maintain customer stability, maintain market position and reduce corporate losses.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Hybrid black-box classification for customer churn prediction with segmented interpretability analysis
    De Caigny, Arno
    De Bock, Koen W.
    Verboven, Sam
    DECISION SUPPORT SYSTEMS, 2024, 181
  • [2] TreeLogit model for customer churn prediction
    Qi, Jiayin
    Zhang, Yangming
    Zhang, Yingying
    Shi, Shuang
    APSCC: 2006 IEEE ASIA-PACIFIC CONFERENCE ON SERVICES COMPUTING, PROCEEDINGS, 2006, : 70 - +
  • [3] ADTreesLogit model for customer churn prediction
    Qi, Jiayin
    Zhang, Li
    Liu, Yanping
    Li, Ling
    Zhou, Yongpin
    Shen, Yao
    Liang, Liang
    Li, Huaizu
    ANNALS OF OPERATIONS RESEARCH, 2009, 168 (01) : 247 - 265
  • [4] ADTreesLogit model for customer churn prediction
    Jiayin Qi
    Li Zhang
    Yanping Liu
    Ling Li
    Yongpin Zhou
    Yao Shen
    Liang Liang
    Huaizu Li
    Annals of Operations Research, 2009, 168
  • [5] Customer churn prediction by hybrid model
    Lee, Jae Sik
    Lee, Jin Chun
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 959 - 966
  • [6] Research on a Predictive Model of the Customer Churn
    Fang Xu
    2011 7TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING (WICOM), 2011,
  • [7] Social network analysis for customer churn prediction
    Verbeke, Wouter
    Martens, David
    Baesens, Bart
    APPLIED SOFT COMPUTING, 2014, 14 : 431 - 446
  • [8] An analysis on classification models for customer churn prediction
    Mouli, Kathi Chandra
    Raghavendran, Ch. V.
    Bharadwaj, V. Y.
    Vybhavi, G. Y.
    Sravani, C.
    Vafaeva, Khristina Maksudovna
    Deorari, Rajesh
    Hussein, Laith
    COGENT ENGINEERING, 2024, 11 (01):
  • [9] Telecom customer churn prediction model : Analysis of machine learning techniques for churn prediction and factor identification in telecom sector
    Pareek, Anshul
    Poonam
    Arora, Shaifali Madan
    Gupta, Nidhi
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2024, 45 (02): : 613 - 630
  • [10] Enhancing Customer Churn Prediction in the Banking Sector through Hybrid Segmented Models with Model-Agnostic Interpretability Techniques
    Vashistha, Astha
    Tiwari, Anoop Kumar
    Ghai, Shubhdeep Singh
    Yadav, Paritosh Kumar
    Pandey, Sudhakar
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2024,