A novel evolutionary ensemble prediction model using harmony search and stacking for diabetes diagnosis

被引:4
|
作者
Zhang, Zaiheng [1 ]
Lu, Yanjie [2 ]
Ye, Mingtao [2 ]
Huang, Wanyu [1 ]
Jin, Lixu [3 ]
Zhang, Guodao [2 ,4 ,5 ]
Ge, Yisu [6 ]
Baghban, Alireza [7 ]
Zhang, Qiwen [2 ]
Wang, Haiou [3 ]
Zhu, Wenzong [8 ,9 ]
机构
[1] Zhejiang Chinese Med Univ, Sch Clin Med 3, Hangzhou 310053, Peoples R China
[2] Hangzhou Dianzi Univ, Inst Intelligent Media Comp, Hangzhou 310018, Peoples R China
[3] Wenzhou Med Univ, Affiliated Hosp 1, Dept Obstet, Wenzhou 325000, Peoples R China
[4] Tianjin Univ Technol, Minist Educ, Key Lab Comp Vis & Syst, Tianjin 300384, Peoples R China
[5] Ocean Univ China, Coll Engn, Qingdao 266100, Peoples R China
[6] Wenzhou Univ, Zhejiang Key Lab Intelligent Informat Safety & Eme, Wenzhou 325035, Peoples R China
[7] NISOC Co, Proc Engn Dept, Ahvaz, Iran
[8] Wenzhou Hosp Integrated Tradit Chinese & Western M, Wenzhou 325000, Peoples R China
[9] Zhejiang Chinese Med Univ, Wenzhou TCM Hosp, Wenzhou Hosp Tradit Chinese Med, Wenzhou 325000, Peoples R China
关键词
Diabetes diagnosis; Ensemble learning; Stacking; Feature selection; Harmony search; Combination optimization; MACHINE LEARNING ALGORITHMS; ANT COLONY OPTIMIZATION; FUTURE-DEVELOPMENT; TYPE-2; OBESITY; RISK; COMPLICATIONS; ASSOCIATION; POPULATION; FITNESS;
D O I
10.1016/j.jksuci.2023.101873
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Diabetes is a dreaded disease that can be identified by elevated blood glucose levels in the blood, and undiagnosed diabetes can cause a host of related complications, such as retinopathy and nephropathy. In terms of type, the main categories are type 1 diabetes (T1DM), type 2 diabetes (T2DM) and gestational diabetes mellitus (GDM). Machine learning models and metaheuristic optimization algorithms can play an important role in the early detection, diagnosis and treatment of this disease. To this end, we propose AHDHSStacking, an ensemble learning framework for diabetes mellitus classification and diagnosis that is based on the harmony search (HS) algorithm and stacking and includes two stages of feature selection and optimization of base-learner combinations. To improve the model's overall performance, the average performance of all base learners is used as the feature selection target, and an adaptive hyperparameter strategy is used to accelerate the iterative process. HS is then used to optimize to find the best combination of base learners, which improves model performance while reducing complexity. Following that, we conducted experiments on the Pima Indians Diabetes (PID) dataset and the Chinese and Western Medicine Diabetes (CWMD) dataset, achieving accuracy of 93.09%, precision of 93.22%, recall of 91.60% , F-measure of 92.25%, and MCC of 84.79% on PID dataset, which is better than all benchmark models and validated the model's validity. CWMD dataset experimental results showed that AHDHS-Stacking screened for key features such as age, gender, urinary glucose, fasting glucose, BMI and cholesterol, and can be used as a practical and accurate method for early diabetes prediction.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Early Diabetes Prediction Based on Stacking Ensemble Learning Model
    Liu, JiMin
    Fan, LuHao
    Jia, QuanQiu
    Wen, LongRi
    Shi, ChengFeng
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2687 - 2692
  • [2] Ensemble Pruning Using Harmony Search
    Sheen, Shina
    Aishwarya, S. V.
    Anitha, R.
    Raghavan, S. V.
    Bhaskar, S. M.
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PT II, 2012, 7209 : 13 - 24
  • [3] Stacking-based multi-objective evolutionary ensemble framework for prediction of diabetes mellitus
    Singh, Namrata
    Singh, Pradeep
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2020, 40 (01) : 1 - 22
  • [4] Diabetes prediction model based on GA-XGBoost and stacking ensemble algorithm
    Li, Wenguang
    Peng, Yan
    Peng, Ke
    PLOS ONE, 2024, 19 (09):
  • [5] Development of novel ensemble model using stacking learning and evolutionary computation techniques for automated hepatocellular carcinoma detection
    Ksiazek, Wojciech
    Hammad, Mohamed
    Plawiak, Pawel
    Acharya, U. Rajendra
    Tadeusiewicz, Ryszard
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2020, 40 (04) : 1512 - 1524
  • [6] A Novel Stacking-Based Deterministic Ensemble Model for Infectious Disease Prediction
    Mahajan, Asmita
    Sharma, Nonita
    Aparicio-Obregon, Silvia
    Alyami, Hashem
    Alharbi, Abdullah
    Anand, Divya
    Sharma, Manish
    Goyal, Nitin
    MATHEMATICS, 2022, 10 (10)
  • [7] A novel stacking technique for prediction of diabetes
    Kalagotla, Satish Kumar
    Gangashetty, Suryakanth, V
    Giridhar, Kanuri
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135
  • [8] FS-WOA-stacking: A novel ensemble model for early diagnosis of breast cancer
    Xiao, Tianyun
    Kong, Shanshan
    Zhang, Zichen
    Liu, Fengchun
    Yang, Aimin
    Hua, Dianbo
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 95
  • [9] A Stacking Ensemble Learning Model for Mobile Traffic Prediction
    Li, Zhigang
    Cai, Di
    Wang, Jialin
    Fu, Jingchang
    Qin, Linlin
    Fu, Duomin
    2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2020, : 542 - 547
  • [10] Spam comments prediction using stacking with ensemble learning
    Mehmood, Arif
    On, Byung-Won
    Lee, Ingyu
    Ashraf, Imran
    Choi, Gyu Sang
    10TH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING, 2018, 933