Handling imbalance data in churn prediction using combined SMOTE and RUS with bagging method

被引:0
|
作者
Hartati, Eka Pura [1 ]
Adiwijaya [1 ]
Bijaksana, Moch Arif [1 ]
机构
[1] Telkom Univ, Sch Comp, Bandung, Jawa Barat, Indonesia
关键词
D O I
10.1088/1742-6596/971/1/012007
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Customer churn has become a significant problem and also a challenge for Telecommunication company such as PT. Telkom Indonesia. It is necessary to evaluate whether the big problems of churn customer and the company's managements will make appropriate strategies to minimize the churn and retaining the customer. Churn Customer data which categorized churn Atas Permintaan Sendiri (APS) in this Company is an imbalance data, and this issue is one of the challenging tasks in machine learning. This study will investigate how is handling class imbalance in churn prediction using combined Synthetic Minority Over-Sampling (SMOTE) and Random Under-Sampling (RUS) with Bagging method for a better churn prediction performance's result. The dataset that used is Broadband Internet data which is collected from Telkom Regional 6 Kalimantan. The research firstly using data preprocessing to balance the imbalanced dataset and also to select features by sampling technique SMOIE and RUS, and then building churn prediction model using Bagging methods and C4.5.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Handling class imbalance in customer churn prediction
    Burez, J.
    Van den Poel, D.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 4626 - 4636
  • [2] A Comparison of Two Oversampling Techniques (SMOTE vs MTDF) for Handling Class Imbalance Problem: A Case Study of Customer Churn Prediction
    Amin, Adnan
    Rahim, Faisal
    Ali, Imtiaz
    Khan, Changez
    Anwar, Sajid
    [J]. NEW CONTRIBUTIONS IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, PT 1, 2015, 353 : 215 - 225
  • [3] Handling Imbalanced Data in Customer Churn Prediction Using Combined Sampling and Weighted Random Forest
    Effendy, Veronikha
    Adiwijaya
    Baizal, Z. K. A.
    [J]. 2014 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2014,
  • [4] Performance of RUS and SMOTE Method on Twitter Spam Data Using Random Forest
    Ubaya, Huda
    Juairiah, Ria Siti
    [J]. 3RD FORUM IN RESEARCH, SCIENCE, AND TECHNOLOGY (FIRST 2019) INTERNATIONAL CONFERENCE, 2020, 1500
  • [5] Handling Imbalanced Data in Churn Prediction using ADASYN and Backpropagation Algorithm
    Aditsania, Annisa
    Adiwijaya
    Saonard, Aldo Lionel
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON SCIENCE IN INFORMATION TECHNOLOGY (ICSITECH), 2017, : 533 - 536
  • [6] A Novel Distribution Analysis for SMOTE Oversampling Method in Handling Class Imbalance
    Elreedy, Dina
    Atiya, Amir F.
    [J]. COMPUTATIONAL SCIENCE - ICCS 2019, PT III, 2019, 11538 : 236 - 248
  • [7] A bagging-based selective ensemble model for churn prediction on imbalanced data
    Zhu, Bing
    Qian, Cheng
    vanden Broucke, Seppe
    Xiao, Jin
    Li, Yuanyuan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 227
  • [8] Adaptive Optimization-Enabled Neural Networks to Handle the Imbalance Churn Data in Churn Prediction
    Garimella, Bharathi
    Prasad, G. V. S. N. R. V.
    Prasad, M. H. M. Krishna
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2021, 20 (04)
  • [9] Failure prediction of Indian Banks using SMOTE, Lasso regression, bagging and boosting
    Shrivastava, Santosh
    Jeyanthi, P. Mary
    Singh, Sarbjit
    [J]. COGENT ECONOMICS & FINANCE, 2020, 8 (01):
  • [10] Churn Prediction using Complaints Data
    Hadden, John
    Tiwari, Ashutosh
    Roy, Rajkumar
    Ruta, Dymtr
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 13, 2006, 13 : 158 - +