Implementation of ensemble machine learning classifiers to predict diarrhoea with SMOTEENN, SMOTE, and SMOTETomek class imbalance approaches

被引:0
|
作者
Mbunge, Elliot [1 ]
Millham, Richard C. [2 ]
Sibiya, Maureen Nokuthula [3 ]
Chemhaka, Garikayi [4 ]
Takavarasha, Sam, Jr. [5 ]
Muchemwa, Benhildah [1 ]
Dzinamarira, Tafadzwa [6 ]
机构
[1] Univ Eswatini, Dept Comp Sci, Fac Sci & Engn, Kwaluseni, Manzini, Eswatini
[2] Durban Univ Technol, Dept Informat, Fac Accounting & Informat, ZA-4001 Durban, South Africa
[3] Mangosuthu Univ Technol, Res Innovat & Engagement, 511 Griffiths Mxenge Hwy, ZA-4031 Umlazi, South Africa
[4] Univ Eswatini, Dept Stat & Demog, Fac Social Sci, Kwaluseni Campus, Kwaluseni, Eswatini
[5] Womens Univ Africa, Fac Management & Entrepreneurial Sci, Harare, Zimbabwe
[6] ICAP, Harare, Zimbabwe
来源
2023 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY, ICTAS | 2023年
关键词
Diarrhoea; Ensemble methods; Children; class imbalance; machine learning; Prediction; Zimbabwe;
D O I
10.1109/ICTAS56421.2023.10082744
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diarrhoea continues to be a major public health burden and cause of death among children under 5 years in many developing countries. Rotavirus vaccination, hygiene practices, clean water, and health promotion are among the preventive measures implemented to improve child health. Nevertheless, tackling diarrhoea also requires the integration of ensemble machine learning (ML) into health systems to improve child health. However, the integration of ensemble classifiers into health systems in many developing countries is still nascent. Therefore, this study applied SMOTE, SMOTEEN and SMOTETomek class imbalance approaches and ensemble ML classifiers to predict diarrhoea. Ensemble methods significantly improve the performance of conventional ML classifiers. The study revealed that the ExtraTrees classifier achieved a high recall of 96.3%, accuracy of 94.3%, precision of 93.8%, and F1-score of 95% when predicting diarrhoea with SMOTEENN as compared to SMOTE and SMOTETomek. The performance of the HistGradientBoosting classifier also improved and achieved a high recall of 95.2%, accuracy of 91.5%, precision of 90.4%, and F1score of 92.7%. The paper also shows that ensemble methods are increasingly becoming state-of-the-art solutions for multiple challenges encountered with ML algorithms such as overfitting, computationally intensive, underfitting and representation. The paper also demonstrates how ensemble methods are becoming state-of-the-art solutions to multiple problems that arise with ML algorithms. There is a need to develop data- driven applications that incorporate ensemble methods to model and predict diarrhoea to assist policymakers to craft interventions aimed to improve child health.
引用
收藏
页码:90 / 95
页数:6
相关论文
共 16 条
  • [1] ChemTastesPredictor: An ensemble of machine learning classifiers to predict the taste of molecular tastants
    Rojas, Cristian
    Abril-Gonzalez, Monica
    Ballabio, Davide
    Garcia, Fernando
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2025, 261
  • [2] Machine-Learning Approach to Optimize SMOTE Ratio in Class Imbalance Dataset for Intrusion Detection
    Seo, Jae-Hyun
    Kim, Yong-Hyuk
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
  • [3] A Comparative Study of One-Class Classifiers in Machine Learning Problems with Extreme Class Imbalance
    Sotiropoulos, Dionysios
    Giannoulis, Christos
    Tsihrintzis, George A.
    5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, 2014, : 362 - 364
  • [4] A clustering based ensemble of weighted kernelized extreme learning machine for class imbalance learning
    Choudhary, Roshani
    Shukla, Sanyam
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164
  • [5] Ensemble Methods with Statistics and Machine Learning on the Class Imbalance Problems of EEG data
    Mishra, Sneha
    Jaiswal, Umesh Chandra
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (05) : 453 - 462
  • [6] Enhancing Medicare Fraud Detection Through Machine Learning: Addressing Class Imbalance With SMOTE-ENN
    Bounab, Rayene
    Zarour, Karim
    Guelib, Bouchra
    Khlifa, Nawres
    IEEE ACCESS, 2024, 12 : 54382 - 54396
  • [7] The Impact of the SMOTE Method on Machine Learning and Ensemble Learning Performance Results in Addressing Class Imbalance in Data Used for Predicting Total Testosterone Deficiency in Type 2 Diabetes Patients
    Kivrak, Mehmet
    Avci, Ugur
    Uzun, Hakki
    Ardic, Cuneyt
    DIAGNOSTICS, 2024, 14 (23)
  • [8] Ensemble of subset online sequential extreme learning machine for class imbalance and concept drift
    Mirza, Bilal
    Lin, Zhiping
    Liu, Nan
    NEUROCOMPUTING, 2015, 149 : 316 - 329
  • [9] Addressing Class Imbalance in Intrusion Detection: A Comprehensive Evaluation of Machine Learning Approaches
    Shanmugam, Vaishnavi
    Razavi-Far, Roozbeh
    Hallaji, Ehsan
    ELECTRONICS, 2025, 14 (01):
  • [10] Ensemble Learning Approaches to Data Imbalance and Competing Objectives in Design of an Industrial Machine Vision System
    Zuvela, Petar
    Lovric, Mario
    Yousefian-Jazi, Ali
    Liu, J. Jay
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2020, 59 (10) : 4636 - 4645