Study on risk factors of impaired fasting glucose and development of a prediction model based on Extreme Gradient Boosting algorithm

被引:0
|
作者
Cui, Qiyuan [1 ]
Pu, Jianhong [1 ]
Li, Wei [2 ]
Zheng, Yun [1 ]
Lin, Jiaxi [3 ]
Liu, Lu [3 ]
Xue, Peng [4 ]
Zhu, Jinzhou [3 ]
He, Mingqing [1 ]
机构
[1] Soochow Univ, Dept Geriatr, Affiliated Hosp 1, Suzhou, Jiangsu, Peoples R China
[2] Nanjing Univ, Phys Examinat Ctr, Affiliated Suzhou Hosp, Med Sch, Suzhou, Jiangsu, Peoples R China
[3] Soochow Univ, Dept Gastroenterol, Affiliated Hosp 1, Suzhou, Jiangsu, Peoples R China
[4] Nanjing Univ, Dept Endocrinol, Affiliated Suzhou Hosp, Med Sch, Suzhou, Jiangsu, Peoples R China
来源
关键词
impaired fasting glucose; prediction model; artificial intelligence; cohort study; middle-aged and elderly people; DIAGNOSIS; TOLERANCE;
D O I
10.3389/fendo.2024.1368225
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective The aim of this study was to develop and validate a machine learning-based model to predict the development of impaired fasting glucose (IFG) in middle-aged and older elderly people over a 5-year period using data from a cohort study.Methods This study was a retrospective cohort study. The study population was 1855 participants who underwent consecutive physical examinations at the First Affiliated Hospital of Soochow University between 2018 and 2022.The dataset included medical history, physical examination, and biochemical index test results. The cohort was randomly divided into a training dataset and a validation dataset in a ratio of 8:2. The machine learning algorithms used in this study include Extreme Gradient Boosting (XGBoost), Support Vector Machines (SVM), Naive Bayes, Decision Trees (DT), and traditional Logistic Regression (LR). Feature selection, parameter optimization, and model construction were performed in the training set, while the validation set was used to evaluate the predictive performance of the models. The performance of these models is evaluated by an area under the receiver operating characteristic (ROC) curves (AUC), calibration curves and decision curve analysis (DCA). To interpret the best-performing model, the Shapley Additive exPlanation (SHAP) Plots was used in this study.Results The training/validation dataset consists of 1,855 individuals from the First Affiliated Hospital of Soochow University, yielded significant variables following selection by the Boruta algorithm and logistic multivariate regression analysis. These significant variables included systolic blood pressure (SBP), fatty liver, waist circumference (WC) and serum creatinine (Scr). The XGBoost model outperformed the other models, demonstrating an AUC of 0.7391 in the validation set.Conclusions The XGBoost model was composed of SBP, fatty liver, WC and Scr may assist doctors with the early identification of IFG in middle-aged and elderly people.
引用
收藏
页数:13
相关论文
共 50 条
  • [11] Online Prediction and Correction of Static Voltage Stability Index Based on Extreme Gradient Boosting Algorithm
    Qin, Huiling
    Li, Shuang
    Zhang, Juncheng
    Rao, Zhi
    He, Chengyu
    Chen, Zhijun
    Li, Bo
    ENERGIES, 2024, 17 (22)
  • [12] Prevalence and risk factors of diabetes and impaired fasting glucose in Nauru
    Amina Khambalia
    Philayrath Phongsavan
    Ben J Smith
    Kieren Keke
    Li Dan
    Andrew Fitzhardinge
    Adrian E Bauman
    BMC Public Health, 11
  • [13] Prevalence and risk factors of diabetes and impaired fasting glucose in Nauru
    Khambalia, Amina
    Phongsavan, Philayrath
    Smith, Ben J.
    Keke, Kieren
    Dan, Li
    Fitzhardinge, Andrew
    Bauman, Adrian E.
    BMC PUBLIC HEALTH, 2011, 11
  • [14] RISK FACTORS FOR IMPAIRED FASTING GLUCOSE IN GREEK NONOBESE ADOLESCENTS
    Voulgari, C.
    Pagoni, S.
    ATHEROSCLEROSIS, 2019, 287 : E133 - E133
  • [15] Differences between impaired fasting glucose and impaired glucose tolerance:: Associated risk factors in the population
    de Pablos, P
    Rodríguez, F
    Martínez, J
    Sánchez, V
    Santana, C
    García, I
    Macías, A
    DIABETOLOGIA, 1999, 42 : A44 - A44
  • [16] Development of an Extreme Gradient Boosting Model Integrated With Evolutionary Algorithms for Hourly Water Level Prediction
    Nguyen, Duc Hai
    Hien Le, Xuan
    Heo, Jae-Yeong
    Bae, Deg-Hyo
    IEEE ACCESS, 2021, 9 : 125853 - 125867
  • [17] Related Factors for Impaired Fasting Glucose in Korean Adults: A Population Based Study
    Hyunjin Lee
    Bohyun Kim
    Youngshin Song
    BMC Public Health, 21
  • [18] Related Factors for Impaired Fasting Glucose in Korean Adults: A Population Based Study
    Lee, Hyunjin
    Kim, Bohyun
    Song, Youngshin
    BMC PUBLIC HEALTH, 2021, 21 (01)
  • [19] Cryptocurrency Price Prediction Using Enhanced PSO with Extreme Gradient Boosting Algorithm
    Srivastava, Vibha
    Dwivedi, Vijay Kumar
    Singh, Ashutosh Kumar
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2023, 23 (02) : 170 - 187
  • [20] Prediction of pullout interaction coefficient of geogrids by extreme gradient boosting model
    Pant, Aali
    Ramana, G. V.
    GEOTEXTILES AND GEOMEMBRANES, 2022, 50 (06) : 1188 - 1198