An ensemble learning approach for diabetes prediction using boosting techniques

被引:11
|
作者
Ganie, Shahid Mohammad [1 ]
Pramanik, Pijush Kanti Dutta [2 ]
Malik, Majid Bashir [3 ]
Mallik, Saurav [4 ]
Qin, Hong [5 ]
机构
[1] Woxsen Univ, AI Res Ctr, Sch Business, Hyderabad, India
[2] Galgotias Univ, Sch Comp Applicat & Technol, Greater Noida, India
[3] Baba Ghulam Shah Badshah Univ, Dept Comp Sci, Rajauri, India
[4] Harvard Univ, Sch Publ Hlth, Dept Environm Hlth, Boston, MA 02138 USA
[5] Univ Tennessee Chattanooga, Coll Engn & Comp Sci, Chattanooga, TN 37403 USA
关键词
diabetes prediction; ensemble learning; XGBoost; CatBoost; LightGBM; AdaBoost; gradient boost;
D O I
10.3389/fgene.2023.1252159
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Introduction: Diabetes is considered one of the leading healthcare concerns affecting millions worldwide. Taking appropriate action at the earliest stages of the disease depends on early diabetes prediction and identification. To support healthcare providers for better diagnosis and prognosis of diseases, machine learning has been explored in the healthcare industry in recent years.Methods: To predict diabetes, this research has conducted experiments on five boosting algorithms on the Pima diabetes dataset. The dataset was obtained from the University of California, Irvine (UCI) machine learning repository, which contains several important clinical features. Exploratory data analysis was used to identify the characteristics of the dataset. Moreover, upsampling, normalisation, feature selection, and hyperparameter tuning were employed for predictive analytics.Results: The results were analysed using various statistical/machine learning metrics and k-fold cross-validation techniques. Gradient boosting achieved the greatest accuracy rate of 92.85% among all the classifiers. Precision, recall, f1-score, and receiver operating characteristic (ROC) curves were used to further validate the model.Discussion: The suggested model outperformed the current studies in terms of prediction accuracy, demonstrating its applicability to other diseases with similar predicate indications.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Diabetes prediction using machine learning and explainable AI techniques
    Tasin, Isfafuzzaman
    Nabil, Tansin Ullah
    Islam, Sanjida
    Khan, Riasat
    HEALTHCARE TECHNOLOGY LETTERS, 2023, 10 (1-2) : 1 - 10
  • [42] Diabetes Prediction Using Ensemble Methods
    Tiwari, Stuti
    Dhanda, Namrata
    AMBIENT INTELLIGENCE IN HEALTH CARE, ICAIHC 2022, 2023, 317 : 405 - 415
  • [43] Predictive Analysis and Prognostic Approach of Diabetes Prediction with Machine Learning Techniques
    J. Omana
    M. Moorthi
    Wireless Personal Communications, 2022, 127 : 465 - 478
  • [44] Predictive Performance of Ensemble Learning Boosting Techniques in Daily Streamflow Simulation
    Chandran, Divya
    Chithra, N. R.
    WATER RESOURCES MANAGEMENT, 2025, 39 (03) : 1235 - 1259
  • [45] Predictive Analysis and Prognostic Approach of Diabetes Prediction with Machine Learning Techniques
    Omana, J.
    Moorthi, M.
    WIRELESS PERSONAL COMMUNICATIONS, 2022, 127 (01) : 465 - 478
  • [46] RG Hyperparameter Optimization Approach for Improved Indirect Prediction of Blood Glucose Levels by Boosting Ensemble Learning
    Wang, Yufei
    Zhang, Haiyang
    An, Yongli
    Ji, Zhanlin
    Ganchev, Ivan
    ELECTRONICS, 2021, 10 (15)
  • [47] Construction of an Ensemble Scheme for Stock Price Prediction Using Deep Learning Techniques
    Appati, Justice Kwame
    Denwar, Ismail Wafaa
    Owusu, Ebenezer
    Soli, Michael Agbo Tettey
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2021, 17 (02) : 72 - 95
  • [48] Field scale wheat yield prediction using ensemble machine learning techniques
    Gawdiya, Sandeep
    Kumar, Dinesh
    Ahmed, Bulbul
    Sharma, Ramandeep Kumar
    Das, Pankaj
    Choudhary, Manoj
    Mattar, Mohamed A.
    SMART AGRICULTURAL TECHNOLOGY, 2024, 9
  • [49] Prediction of spontaneous imbibition in porous media using deep and ensemble learning techniques
    Mahdaviara, Mehdi
    Sharifi, Mohammad
    Bakhshian, Sahar
    Shokri, Nima
    FUEL, 2022, 329
  • [50] Diabetes Prediction Using Enhanced SVM and Deep Neural Network Learning Techniques: An Algorithmic Approach for Early Screening of Diabetes
    Nagaraj, P.
    Deepalakshmi, P.
    INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2021, 16 (04) : 1 - 20