eDiaPredict: An Ensemble-based Framework for Diabetes Prediction

被引:31
|
作者
Singh, Ashima [1 ]
Dhillon, Arwinder [1 ]
Kumar, Neeraj [1 ]
Hossain, M. Shamim [2 ,3 ]
Muhammad, Ghulam [4 ]
Kumar, Manoj [5 ]
机构
[1] Thapar Univ, Comp Sci & Engn Dept, Patiala, Punjab, India
[2] King Saud Univ, Res Chair Pervas & Mobile Comp, Riyadh 11543, Saudi Arabia
[3] King Saud Univ, Dept Software Engn, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
[4] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh, Saudi Arabia
[5] SMVD Univ, Katra, India
关键词
Diabetes prediction; ensembled models; XGBoost; decision tree; random forest;
D O I
10.1145/3415155
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Medical systems incorporate modern computational intelligence in healthcare. Machine learning techniques are applied to predict the onset and reoccurrence of the disease, identify biomarkers for survivability analysis depending upon certain health conditions of the patient. Early prediction of diseases like diabetes is essential as the number of diabetic patients of all age groups is increasing rapidly. To identify underlying reasons for the onset of diabetes in its early stage has become a challenging task for medical practitioners. Continuously increasing diabetic patient data has necessitated for the applications of efficient machine learning algorithms, which learns from the trends of the underlying data and recognizes the critical conditions in patients. In this article, an ensemble-based framework named eDiaPredict is proposed. It uses ensemble modeling, which includes an ensemble of different machine learning algorithms comprising XGBoost, Random Forest, Support Vector Machine, Neural Network, and Decision tree to predict diabetes status among patients. The performance of eDiaPredict has been evaluated using various performance parameters like accuracy, sensitivity, specificity, Gini Index, precision, area under curve, area under convex hull, minimum error rate, and minimum weighted coefficient. The effectiveness of the proposed approach is shown by its application on the PIMA Indian diabetes dataset wherein an accuracy of 95% is achieved.
引用
收藏
页数:26
相关论文
共 50 条
  • [41] Ensemble-Based Machine Learning Algorithm for Loan Default Risk Prediction
    Akinjole, Abisola
    Shobayo, Olamilekan
    Popoola, Jumoke
    Okoyeigbo, Obinna
    Ogunleye, Bayode
    MATHEMATICS, 2024, 12 (21)
  • [42] Ensemble-based deep learning techniques for customer churn prediction model
    Subramanian, R. Siva
    Yamini, B.
    Sudha, Kothandapani
    Sivakumar, S.
    KYBERNETES, 2024,
  • [43] Beating the Uncertainties: Ensemble Forecasting and Ensemble-Based Data Assimilation in Modern Numerical Weather Prediction
    Zhang, Hailing
    Pu, Zhaoxia
    ADVANCES IN METEOROLOGY, 2010, 2010
  • [44] Ensemble-based data assimilation
    Zhang, Fuqing
    Snyder, Chris
    BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 2007, 88 (04) : 565 - 568
  • [45] Ensemble-based sensitivity analysis
    Torn, Ryan D.
    Hakim, Gregory J.
    MONTHLY WEATHER REVIEW, 2008, 136 (02) : 663 - 677
  • [46] Ensemble-based enzyme design
    Jan-Stefan Völler
    Nature Catalysis, 2020, 3 : 774 - 774
  • [47] An Adaptive Ensemble-based Algorithm
    Tenne, Yoel
    APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2017, 1836
  • [48] An Ensemble-Based Scalable Approach for Intrusion Detection Using Big Data Framework
    Sahu, Santosh Kumar
    Mohapatra, Durga Prasad
    Rout, Jitendra Kumar
    Sahoo, Kshira Sagar
    Luhach, Ashish Kr
    BIG DATA, 2021, 9 (04) : 303 - 321
  • [49] An ensemble-based likelihood ratio approach for family-based genomic risk prediction
    An, Hui
    Wei, Chang-shuai
    Wang, Oliver
    Wang, Da-hui
    Xu, Liang-wen
    Lu, Qing
    Ye, Cheng-yin
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE B, 2018, 19 (12): : 935 - 947
  • [50] Elucidating model inadequacies in a cloud parameterization by use of an ensemble-based calibration framework
    Golaz, Jean-Christophe
    Larson, Vincent E.
    Hansen, James A.
    Schanen, David P.
    Griffin, Brian M.
    MONTHLY WEATHER REVIEW, 2007, 135 (12) : 4077 - 4096