Machine Learning Models Integrating Dietary Indicators Improve the Prediction of Progression from Prediabetes to Type 2 Diabetes Mellitus

被引:0
|
作者
Li, Zhuoyang [1 ]
Li, Yuqian [2 ]
Mao, Zhenxing [1 ]
Wang, Chongjian [1 ]
Hou, Jian [1 ]
Zhao, Jiaoyan [1 ]
Wang, Jianwei [1 ]
Tian, Yuan [1 ]
Li, Linlin [1 ]
机构
[1] Zhengzhou Univ, Coll Publ Hlth, Dept Epidemiol & Hlth Stat, Zhengzhou 450001, Peoples R China
[2] Zhengzhou Univ, Sch Pharmaceut Sci, Dept Clin Pharmacol, Zhengzhou 450001, Peoples R China
关键词
type 2 diabetes mellitus; prediabetes; diet; machine learning; prediction model; CARDIOVASCULAR-DISEASE; RISK;
D O I
10.3390/nu17060947
中图分类号
R15 [营养卫生、食品卫生]; TS201 [基础科学];
学科分类号
100403 ;
摘要
Background: Diet plays an important role in preventing and managing the progression from prediabetes to type 2 diabetes mellitus (T2DM). This study aims to develop prediction models incorporating specific dietary indicators and explore the performance in T2DM patients and non-T2DM patients. Methods: This retrospective study was conducted on 2215 patients from the Henan Rural Cohort. The key variables were selected using univariate analysis and the least absolute shrinkage and selection operator (LASSO). Multiple predictive models were constructed separately based on dietary and clinical factors. The performance of different models was compared and the impact of integrating dietary factors on prediction accuracy was evaluated. Receiver operating characteristic (ROC) curve, calibration curve, and decision curve analysis (DCA) were used to evaluate the predictive performance. Meanwhile, group and spatial validation sets were used to further assess the models. SHapley Additive exPlanations (SHAP) analysis was applied to identify key factors influencing the progression of T2DM. Results: Nine dietary indicators were quantitatively collected through standardized questionnaires to construct dietary models. The extreme gradient boosting (XGBoost) model outperformed the other three models in T2DM prediction. The area under the curve (AUC) and F1 score of the dietary model in the validation cohort were 0.929 [95% confidence interval (CI) 0.916-0.942] and 0.865 (95%CI 0.845-0.884), respectively. Both were higher than the traditional model (AUC and F1 score were 0.854 and 0.779, respectively, p < 0.001). SHAP analysis showed that fasting plasma glucose, eggs, whole grains, income level, red meat, nuts, high-density lipoprotein cholesterol, and age were key predictors of the progression. Additionally, the calibration curves displayed a favorable agreement between the dietary model and actual observations. DCA revealed that employing the XGBoost model to predict the risk of T2DM occurrence would be advantageous if the threshold were beyond 9%. Conclusions: The XGBoost model constructed by dietary indicators has shown good performance in predicting T2DM. Emphasizing the role of diet is crucial in personalized patient care and management.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Dietary antioxidant capacity and risk of type 2 diabetes mellitus, prediabetes and insulin resistance: the Rotterdam Study
    van der Schaft, Niels
    Schoufour, Josje D.
    Nano, Jana
    Kiefte-de Jong, Jessica C.
    Muka, Taulant
    Sijbrands, Eric J. G.
    Ikram, M. Arfan
    Franco, Oscar H.
    Voortman, Trudy
    EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2019, 34 (09) : 853 - 861
  • [42] Prediction of complications of type 2 Diabetes: A Machine learning approach
    Nicolucci, Antonio
    Romeo, Luca
    Bernardini, Michele
    Vespasiani, Marco
    Rossi, Maria Chiara
    Petrelli, Massimiliano
    Ceriello, Antonio
    Di Bartolo, Paolo
    Frontoni, Emanuele
    Vespasiani, Giacomo
    DIABETES RESEARCH AND CLINICAL PRACTICE, 2022, 190
  • [43] Prediction of Type 2 Diabetes Based on Machine Learning Algorithm
    Deberneh, Henock M.
    Kim, Intaek
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (06)
  • [44] Leveraging Machine Learning for Precise Prediction of Type 2 Diabetes
    Barakeh, Raghad
    DIABETES, 2024, 73
  • [45] Nuclear magnetic resonance-based metabolomics with machine learning for predicting progression from prediabetes to diabetes
    Li, Jiang
    Yu, Yuefeng
    Sun, Ying
    Fu, Yanqi
    Shen, Wenqi
    Cai, Lingli
    Tan, Xiao
    Cai, Yan
    Wang, Ningjian
    Lu, Yingli
    Wang, Bin
    ELIFE, 2024, 13
  • [46] Diabetes mellitus prediction and diagnosis from a data preprocessing and machine learning perspective
    Olisah, Chollette C.
    Smith, Lyndon
    Smith, Melvyn
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 220
  • [47] Machine Learning Models for Data-Driven Prediction of Diabetes by Lifestyle Type
    Qin, Yifan
    Wu, Jinlong
    Xiao, Wen
    Wang, Kun
    Huang, Anbing
    Liu, Bowen
    Yu, Jingxuan
    Li, Chuhao
    Yu, Fengyu
    Ren, Zhanbing
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (22)
  • [48] Clinical and Biochemical Markers of Nonprogression to Type 2 Diabetes Mellitus from a "Prediabetes" Stage
    Sohani, Zahra N.
    Anand, Sonia S.
    Gerstein, Hertzel C.
    DIABETES, 2014, 63 : A365 - A365
  • [49] Performance analysis and prediction of type 2 diabetes mellitus based on lifestyle data using machine learning approaches
    Ganie, Shahid Mohammad
    Malik, Majid Bashir
    Arif, Tasleem
    JOURNAL OF DIABETES AND METABOLIC DISORDERS, 2022, 21 (01) : 339 - 352
  • [50] Performance analysis and prediction of type 2 diabetes mellitus based on lifestyle data using machine learning approaches
    Shahid Mohammad Ganie
    Majid Bashir Malik
    Tasleem Arif
    Journal of Diabetes & Metabolic Disorders, 2022, 21 : 339 - 352