An empirical study on the class imbalance handling techniques for different diseases

被引:1
|
作者
Rhmann W. [1 ]
机构
[1] School of Computer Application, Lovely Professional University, Punjab, Phagwara
关键词
Cost sensitive neural network; Deep learning; Disease; Genetic algorithm; Statistical test;
D O I
10.1007/s00500-024-09881-y
中图分类号
学科分类号
摘要
Machine learning and deep learning-based techniques are now widely used to identify and diagnose various diseases. However, getting a sufficient data for these machine learning models is difficult, and usually collected data is unbalanced i.e. less number of instances with disease class and a large number of classes without the disease. These imbalanced data cause poor performance of the classifier in the detection of minority or disease classes. To address this class imbalance problem for medical data we have applied 17 different class imbalance handling techniques on four publically available datasets with Random forest as a base classifier. The comprehensive review covering the time frame of 1990 to 2023 is also done on “Class imbalance handling techniques” to gather insights using VOSviewer software. Performances of different class imbalance handling techniques are statistically evaluated and impact of different disease datasets on the prediction performance is also statistically assessed. Two novel techniques Genetic Algorithm-Cost sensitive-Deep neural network(GA-CS-DNN) and Class imbalance handling technique-Genetic Algorithm-Deep neural Network(CIH-GA-DNN)are proposed for handling class imbalance problems. Performances of proposed techniques are compared with other state of art class imbalance handling techniques and obtained results showed that OnesidedSelection outperformed all other techniques. A statistical test further demonstrated that OnesidedSelection performs differently than SMOTENN. Significant statistical differences in illness prediction can be seen between the kidney and diabetes, prostate and kidney, and kidney and heart datasets when compared pair-wise. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.
引用
收藏
页码:11439 / 11456
页数:17
相关论文
共 50 条
  • [21] The Performance Stability of Defect Prediction Models with Class Imbalance: An Empirical Study
    Yu, Qiao
    Jiang, Shujuan
    Zhang, Yanmei
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (02) : 265 - 272
  • [22] An Empirical Study for the Multi-class Imbalance Problem with Neural Networks
    Alejo, R.
    Sotoca, J. M.
    Casan, G. A.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2008, 5197 : 479 - +
  • [23] The influence of class imbalance on cost-sensitive learning: An empirical study
    Liu, Xu-Ying
    Zhou, Zhi-Hua
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 970 - +
  • [24] Feedforward neural network models for handling class overlap and class imbalance
    Kretzschmar, R
    Karayiannis, NB
    Eggimann, F
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (05) : 323 - 338
  • [25] Handling Extreme Class Imbalance in Technical Logbook Datasets
    Akhbardeh, Farhad
    Alm, Cecilia Ovesdotter
    Zampieri, Marcos
    Desell, Travis
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4034 - 4045
  • [26] Handling Class Imbalance in Online Transaction Fraud Detection
    Kanika
    Singla, Jimmy
    Bashir, Ali Kashif
    Nam, Yunyoung
    Hasan, Najam U., I
    Tariq, Usman
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (02): : 2861 - 2877
  • [27] An Improved Hybrid Approach for Handling Class Imbalance Problem
    Abeer S. Desuky
    Sadiq Hussain
    Arabian Journal for Science and Engineering, 2021, 46 : 3853 - 3864
  • [28] Adjusting and generalizing CBA algorithm to handling class imbalance
    Chen, Wen-Chin
    Hsu, Chiun-Chieh
    Hsu, Jing-Ning
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) : 5907 - 5919
  • [29] An Improved Hybrid Approach for Handling Class Imbalance Problem
    Desuky, Abeer S.
    Hussain, Sadiq
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (04) : 3853 - 3864
  • [30] Resampling Techniques Study on Class Imbalance Problem in Credit Risk Prediction
    Zhao, Zixue
    Cui, Tianxiang
    Ding, Shusheng
    Li, Jiawei
    Bellotti, Anthony Graham
    MATHEMATICS, 2024, 12 (05)