A novel evolutionary ensemble prediction model using harmony search and stacking for diabetes diagnosis

被引:4
|
作者
Zhang, Zaiheng [1 ]
Lu, Yanjie [2 ]
Ye, Mingtao [2 ]
Huang, Wanyu [1 ]
Jin, Lixu [3 ]
Zhang, Guodao [2 ,4 ,5 ]
Ge, Yisu [6 ]
Baghban, Alireza [7 ]
Zhang, Qiwen [2 ]
Wang, Haiou [3 ]
Zhu, Wenzong [8 ,9 ]
机构
[1] Zhejiang Chinese Med Univ, Sch Clin Med 3, Hangzhou 310053, Peoples R China
[2] Hangzhou Dianzi Univ, Inst Intelligent Media Comp, Hangzhou 310018, Peoples R China
[3] Wenzhou Med Univ, Affiliated Hosp 1, Dept Obstet, Wenzhou 325000, Peoples R China
[4] Tianjin Univ Technol, Minist Educ, Key Lab Comp Vis & Syst, Tianjin 300384, Peoples R China
[5] Ocean Univ China, Coll Engn, Qingdao 266100, Peoples R China
[6] Wenzhou Univ, Zhejiang Key Lab Intelligent Informat Safety & Eme, Wenzhou 325035, Peoples R China
[7] NISOC Co, Proc Engn Dept, Ahvaz, Iran
[8] Wenzhou Hosp Integrated Tradit Chinese & Western M, Wenzhou 325000, Peoples R China
[9] Zhejiang Chinese Med Univ, Wenzhou TCM Hosp, Wenzhou Hosp Tradit Chinese Med, Wenzhou 325000, Peoples R China
关键词
Diabetes diagnosis; Ensemble learning; Stacking; Feature selection; Harmony search; Combination optimization; MACHINE LEARNING ALGORITHMS; ANT COLONY OPTIMIZATION; FUTURE-DEVELOPMENT; TYPE-2; OBESITY; RISK; COMPLICATIONS; ASSOCIATION; POPULATION; FITNESS;
D O I
10.1016/j.jksuci.2023.101873
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Diabetes is a dreaded disease that can be identified by elevated blood glucose levels in the blood, and undiagnosed diabetes can cause a host of related complications, such as retinopathy and nephropathy. In terms of type, the main categories are type 1 diabetes (T1DM), type 2 diabetes (T2DM) and gestational diabetes mellitus (GDM). Machine learning models and metaheuristic optimization algorithms can play an important role in the early detection, diagnosis and treatment of this disease. To this end, we propose AHDHSStacking, an ensemble learning framework for diabetes mellitus classification and diagnosis that is based on the harmony search (HS) algorithm and stacking and includes two stages of feature selection and optimization of base-learner combinations. To improve the model's overall performance, the average performance of all base learners is used as the feature selection target, and an adaptive hyperparameter strategy is used to accelerate the iterative process. HS is then used to optimize to find the best combination of base learners, which improves model performance while reducing complexity. Following that, we conducted experiments on the Pima Indians Diabetes (PID) dataset and the Chinese and Western Medicine Diabetes (CWMD) dataset, achieving accuracy of 93.09%, precision of 93.22%, recall of 91.60% , F-measure of 92.25%, and MCC of 84.79% on PID dataset, which is better than all benchmark models and validated the model's validity. CWMD dataset experimental results showed that AHDHS-Stacking screened for key features such as age, gender, urinary glucose, fasting glucose, BMI and cholesterol, and can be used as a practical and accurate method for early diabetes prediction.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Effective And Optimized software Reliability Prediction Using Harmony Search Algorithm
    Altaf, Insha
    Majeed, Insha
    Iqbal, Khan Arshid
    PROCEEDINGS OF 2016 ONLINE INTERNATIONAL CONFERENCE ON GREEN ENGINEERING AND TECHNOLOGIES (IC-GET), 2016,
  • [42] SSEM: A Novel Self-Adaptive Stacking Ensemble Model for Classification
    Jiang, Weili
    Chen, Zhenhua
    Xiang, Yan
    Shao, Dangguo
    Ma, Lei
    Zhang, Junpeng
    IEEE ACCESS, 2019, 7 : 120337 - 120349
  • [43] A Novel Ensemble Model for Brain Tumor Diagnosis
    Talaat, Amira Samy
    NEW MATHEMATICS AND NATURAL COMPUTATION, 2025, 21 (01) : 195 - 211
  • [44] Phishing Websites Detection by Using Optimized Stacking Ensemble Model
    Al-Mekhlafi, Zeyad Ghaleb
    Mohammed, Badiea Abdulkarem
    Al-Sarem, Mohammed
    Saeed, Faisal
    Al-Hadhrami, Tawfik
    Alshammari, Mohammad T.
    Alreshidi, Abdulrahman
    Alshammari, Talal Sarheed
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 41 (01): : 109 - 125
  • [45] Stacking sequence optimization for maximum buckling load of composite plates using harmony search algorithm
    de Almeida, Felipe Schaedler
    COMPOSITE STRUCTURES, 2016, 143 : 287 - 299
  • [46] StackDPP: a stacking ensemble based DNA-binding protein prediction model
    Ahmed, Sheikh Hasib
    Bose, Dibyendu Brinto
    Khandoker, Rafi
    Rahman, M. Saifur
    BMC BIOINFORMATICS, 2024, 25 (01)
  • [47] A stacking ensemble model based on nonlinear feature selection for photovoltaic power prediction
    Tang, Xin
    Zhang, Haiqing
    Li, Daiwei
    Tang, Dan
    Gong, Cheng
    Yu, Xi
    2024 7TH ASIA CONFERENCE ON ENERGY AND ELECTRICAL ENGINEERING, ACEEE 2024, 2024, : 345 - 349
  • [48] MMSE: A Multi-Model Stacking Ensemble Learning Algorithm for Purchase Prediction
    Zhou, Aolong
    Ren, Kaijun
    Li, Xiaoyong
    Zhang, Wen
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 96 - 102
  • [49] Prediction Method for Ocean Wave Height Based on Stacking Ensemble Learning Model
    Zhan, Yu
    Zhang, Huajun
    Li, Jianhao
    Li, Gen
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (08)
  • [50] StackDPP: a stacking ensemble based DNA-binding protein prediction model
    Sheikh Hasib Ahmed
    Dibyendu Brinto Bose
    Rafi Khandoker
    M Saifur Rahman
    BMC Bioinformatics, 25