Applying machine learning techniques to predict and explain subscriber churn of an online drug information platform

被引:0
|
作者
Georgios Theodoridis
Athanasios Tsadiras
机构
[1] Aristotle University of Thessaloniki,
来源
关键词
Customer churn; Online subscriber churn; Method comparison; Ensemble methods; Neural networks; Advanced preprocessing; Boruta algorithm; Isolation forest; Feature importance;
D O I
暂无
中图分类号
学科分类号
摘要
Presently, most markets are extremely saturated and, as a result, businesses are highly competitive. Hence, avoiding the loss of preexisting customers is pivotal, deeming the prediction of customer loss crucial to efficiently target potential churners and attempt to retain them. This study provides an in-depth comparison of various machine learning techniques and advanced preprocessing methods as well as an overall guide for handling churn prediction problems. Churn prediction is fundamentally a binary classification problem. To handle said problem, within this paper, numerous methods that belong to different machine learning categories (linear, nonlinear, ensemble, neural networks) are constructed, optimized and trained on the subscription data of a new real-world dataset originating from a popular online drug information platform that provides information on drugs and drug substances as well as professional tools for pharmacotherapy decision making. In contrast with previous works that address traditional customer churn in relation to telecom, banking or insurance industries, the current study addresses online subscriber churn where users might churn at any given moment. This study also focuses on the proper preprocessing of the given data via advanced machine learning methods, as well as evaluating the models under different conditions to measure their robustness. The results are presented, compared, analyzed and explained. Extensive feature importance analysis is performed to explain not only the models themselves but to also indicate the main factors that contribute toward churning. The findings co-align with the notion that, under the important condition that the dataset is preprocessed using not only statistical methods but machine learning techniques as well, all methods perform adequately and are generally viable options, but ensemble methods, namely Random Forests, are more flexible and resistant toward outliers. Feature importance analysis indicates that usage, not demographic data, is the prime indicator of churn.
引用
收藏
页码:19501 / 19514
页数:13
相关论文
共 50 条
  • [1] Applying machine learning techniques to predict and explain subscriber churn of an online drug information platform
    Theodoridis, Georgios
    Tsadiras, Athanasios
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (22): : 19501 - 19514
  • [2] A Machine Learning Approach to Predict Customer Churn of a Delivery Platform
    Liu, Qing
    Chen, QiuYing
    Lee, Sang-Joon
    [J]. 2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 733 - 735
  • [3] Applying machine learning techniques to predict the properties of energetic materials
    Daniel C. Elton
    Zois Boukouvalas
    Mark S. Butrico
    Mark D. Fuge
    Peter W. Chung
    [J]. Scientific Reports, 8
  • [4] Applying machine learning techniques to predict the properties of energetic materials
    Elton, Daniel C.
    Boukouvalas, Zois
    Butrico, Mark S.
    Fuge, Mark D.
    Chung, Peter W.
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [5] Applying machine learning techniques to predict detonation initiation from hot
    Ryu, Je Ir
    [J]. ENERGY AND AI, 2022, 9
  • [6] Churn Prediction of Employees Using Machine Learning Techniques
    Bandyopadhyay, Nilasha
    Jadhav, Anil
    [J]. TEHNICKI GLASNIK-TECHNICAL JOURNAL, 2021, 15 (01): : 51 - 59
  • [7] A comparison of machine learning techniques for customer churn prediction
    Vafeiadis, T.
    Diamantaras, K. I.
    Sarigiannidis, G.
    Chatzisavvas, K. Ch.
    [J]. SIMULATION MODELLING PRACTICE AND THEORY, 2015, 55 : 1 - 9
  • [8] Applying Machine Learning Techniques for Religious Extremism Detection on Online User Contents
    Mussiraliyeva, Shynar
    Omarov, Batyrkhan
    Yoo, Paul
    Bolatbek, Milana
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (01): : 915 - 934
  • [9] Using topological data analysis and machine learning to predict customer churn
    Sagming, Marcel
    Heymann, Reolyn
    Visaya, Maria Vivien
    [J]. Journal of Big Data, 2024, 11 (01)
  • [10] Machine Learning Techniques for Predicting Customer Churn in A Credit Card Company
    Chang, Victor
    Gao, Xianghua
    Hall, Karl
    Uchenna, Emmanuel
    [J]. 2022 INTERNATIONAL CONFERENCE ON INDUSTRIAL IOT, BIG DATA AND SUPPLY CHAIN, IIOTBDSC, 2022, : 199 - 207