Study on risk factors of impaired fasting glucose and development of a prediction model based on Extreme Gradient Boosting algorithm

被引:0
|
作者
Cui, Qiyuan [1 ]
Pu, Jianhong [1 ]
Li, Wei [2 ]
Zheng, Yun [1 ]
Lin, Jiaxi [3 ]
Liu, Lu [3 ]
Xue, Peng [4 ]
Zhu, Jinzhou [3 ]
He, Mingqing [1 ]
机构
[1] Soochow Univ, Dept Geriatr, Affiliated Hosp 1, Suzhou, Jiangsu, Peoples R China
[2] Nanjing Univ, Phys Examinat Ctr, Affiliated Suzhou Hosp, Med Sch, Suzhou, Jiangsu, Peoples R China
[3] Soochow Univ, Dept Gastroenterol, Affiliated Hosp 1, Suzhou, Jiangsu, Peoples R China
[4] Nanjing Univ, Dept Endocrinol, Affiliated Suzhou Hosp, Med Sch, Suzhou, Jiangsu, Peoples R China
来源
关键词
impaired fasting glucose; prediction model; artificial intelligence; cohort study; middle-aged and elderly people; DIAGNOSIS; TOLERANCE;
D O I
10.3389/fendo.2024.1368225
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective The aim of this study was to develop and validate a machine learning-based model to predict the development of impaired fasting glucose (IFG) in middle-aged and older elderly people over a 5-year period using data from a cohort study.Methods This study was a retrospective cohort study. The study population was 1855 participants who underwent consecutive physical examinations at the First Affiliated Hospital of Soochow University between 2018 and 2022.The dataset included medical history, physical examination, and biochemical index test results. The cohort was randomly divided into a training dataset and a validation dataset in a ratio of 8:2. The machine learning algorithms used in this study include Extreme Gradient Boosting (XGBoost), Support Vector Machines (SVM), Naive Bayes, Decision Trees (DT), and traditional Logistic Regression (LR). Feature selection, parameter optimization, and model construction were performed in the training set, while the validation set was used to evaluate the predictive performance of the models. The performance of these models is evaluated by an area under the receiver operating characteristic (ROC) curves (AUC), calibration curves and decision curve analysis (DCA). To interpret the best-performing model, the Shapley Additive exPlanation (SHAP) Plots was used in this study.Results The training/validation dataset consists of 1,855 individuals from the First Affiliated Hospital of Soochow University, yielded significant variables following selection by the Boruta algorithm and logistic multivariate regression analysis. These significant variables included systolic blood pressure (SBP), fatty liver, waist circumference (WC) and serum creatinine (Scr). The XGBoost model outperformed the other models, demonstrating an AUC of 0.7391 in the validation set.Conclusions The XGBoost model was composed of SBP, fatty liver, WC and Scr may assist doctors with the early identification of IFG in middle-aged and elderly people.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] An energy consumption prediction model for electric buses based on extreme gradient boosting fusion algorithm
    Kang, Yiting
    Wei, Jianshu
    Liu, Zhihua
    Xiao, Ke
    INTERNATIONAL JOURNAL OF GREEN ENERGY, 2025,
  • [2] A Self-Care Prediction Model for Children with Disability Based on Genetic Algorithm and Extreme Gradient Boosting
    Syafrudin, Muhammad
    Alfian, Ganjar
    Fitriyani, Norma Latif
    Anshari, Muhammad
    Hadibarata, Tony
    Fatwanto, Agung
    Rhee, Jongtae
    MATHEMATICS, 2020, 8 (09)
  • [3] A Promising Preoperative Prediction Model for Microvascular Invasion in Hepatocellular Carcinoma Based on an Extreme Gradient Boosting Algorithm
    Liu, Weiwei
    Zhang, Lifan
    Xin, Zhaodan
    Zhang, Haili
    You, Liting
    Bai, Ling
    Zhou, Juan
    Ying, Binwu
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [4] Prediction of Cable Failures based on eXtreme Gradient Boosting
    Zhan, Huiyu
    Liu, Keyan
    Jia, Dongli
    2024 6TH ASIA ENERGY AND ELECTRICAL ENGINEERING SYMPOSIUM, AEEES 2024, 2024, : 610 - 614
  • [5] An Extreme Gradient Boosting-based Prediction for Depression
    Ibrahum, Ahmed
    Park, Kwang Ho
    Hong, Jang-Eui
    Van-Huy Pham
    Ryu, Keun Ho
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1607 - 1613
  • [6] Prediction of voltage stability margin in power system based on extreme gradient boosting algorithm
    Wang H.-F.
    Zhang C.-Y.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (03): : 606 - 613
  • [7] Comparative study on prediction of coal seam gas extraction based on Extreme Gradient Boosting and random forest model improved by optimization algorithm
    Li, Ao
    Li, Xijian
    Cai, Junjie
    Chen, Shoukun
    PHYSICS OF FLUIDS, 2025, 37 (03)
  • [8] Psychosocial factors are independent risk factors for the development of Type 2 diabetes in Japanese workers with impaired fasting glucose and/or impaired glucose tolerance
    Toshihiro, M.
    Saito, K.
    Takikawa, S.
    Takebe, N.
    Onoda, T.
    Satoh, J.
    DIABETIC MEDICINE, 2008, 25 (10) : 1211 - 1217
  • [9] Cardiometabolic risk in impaired fasting glucose and impaired glucose tolerance - The atherosclerosis risk in communities study
    Pankow, James S.
    Kwan, David K.
    Duncan, Bruce B.
    Schmidt, Maria I.
    Couper, David J.
    Golden, Sherita
    Ballantyne, Christie M.
    DIABETES CARE, 2007, 30 (02) : 325 - 331
  • [10] A hybrid prediction model for e-commerce customer churn based on logistic regression and extreme gradient boosting algorithm
    Li X.
    Li Z.
    Ingenierie des Systemes d'Information, 2019, 24 (05): : 525 - 530