Machine learning for characterizing risk of type 2 diabetes mellitus in a rural Chinese population: the Henan Rural Cohort Study

被引:0
|
作者
Liying Zhang
Yikang Wang
Miaomiao Niu
Chongjian Wang
Zhenfei Wang
机构
[1] Zhengzhou University,School of Information Engineering
[2] Zhengzhou University,Department of Epidemiology and Biostatistics, College of Public Health
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
With the development of data mining, machine learning offers opportunities to improve discrimination by analyzing complex interactions among massive variables. To test the ability of machine learning algorithms for predicting risk of type 2 diabetes mellitus (T2DM) in a rural Chinese population, we focus on a total of 36,652 eligible participants from the Henan Rural Cohort Study. Risk assessment models for T2DM were developed using six machine learning algorithms, including logistic regression (LR), classification and regression tree (CART), artificial neural networks (ANN), support vector machine (SVM), random forest (RF) and gradient boosting machine (GBM). The model performance was measured in an area under the receiver operating characteristic curve, sensitivity, specificity, positive predictive value, negative predictive value and area under precision recall curve. The importance of variables was identified based on each classifier and the shapley additive explanations approach. Using all available variables, all models for predicting risk of T2DM demonstrated strong predictive performance, with AUCs ranging between 0.811 and 0.872 using laboratory data and from 0.767 to 0.817 without laboratory data. Among them, the GBM model performed best (AUC: 0.872 with laboratory data and 0.817 without laboratory data). Performance of models plateaued when introduced 30 variables to each model except CART model. Among the top-10 variables across all methods were sweet flavor, urine glucose, age, heart rate, creatinine, waist circumference, uric acid, pulse pressure, insulin, and hypertension. New important risk factors (urinary indicators, sweet flavor) were not found in previous risk prediction methods, but determined by machine learning in our study. Through the results, machine learning methods showed competence in predicting risk of T2DM, leading to greater insights on disease risk factors with no priori assumption of causality.
引用
收藏
相关论文
共 50 条
  • [41] Association of Residential Greenness with the Prevalence of Metabolic Syndrome in a Rural Chinese Population: the Henan Rural Cohort Study
    He Ya Ling
    Liu Xiao Tian
    Tu Run Qi
    Pan Ming Ming
    Niu Miao Miao
    Chen Gong Bo
    Hou Jian
    Mao Zhen Xing
    Huo Wen Qian
    Li Shan Shan
    Guo Yu Ming
    Wang Chong Jian
    [J]. BIOMEDICAL AND ENVIRONMENTAL SCIENCES, 2022, 35 (01) : 89 - +
  • [42] Association of Residential Greenness with the Prevalence of Metabolic Syndrome in a Rural Chinese Population:the Henan Rural Cohort Study
    HE Ya Ling
    LIU Xiao Tian
    TU Run Qi
    PAN Ming Ming
    NIU Miao Miao
    CHEN Gong Bo
    HOU Jian
    MAO Zhen Xing
    HUO Wen Qian
    LI Shan Shan
    GUO Yu Ming
    WANG Chong Jian
    [J]. Biomedical and Environmental Sciences, 2022, 35 (01) : 89 - 94
  • [43] Association of intestinal microbiota markers and dietary pattern in Chinese patients with type 2 diabetes: The Henan rural cohort study
    Wang, Guanjun
    Lyu, Quanjun
    Yang, Tianyu
    Cui, Songyang
    Niu, Kailin
    Gu, Ruohua
    Li, Yan
    Li, Jia
    Xing, Wenguo
    Li, Linlin
    [J]. FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [44] Development and Validation of a Risk-Score Model for Type 2 Diabetes: A Cohort Study of a Rural Adult Chinese Population
    Zhang, Ming
    Zhang, Hongyan
    Wang, Chongjian
    Ren, Yongcheng
    Wang, Bingyuan
    Zhang, Lu
    Yang, Xiangyu
    Zhao, Yang
    Han, Chengyi
    Pang, Chao
    Yin, Lei
    Xue, Yuan
    Zhao, Jingzhi
    Hu, Dongsheng
    [J]. PLOS ONE, 2016, 11 (04):
  • [45] The dose-response relationship of fruit and vegetable intake and risk of type 2 diabetes among rural China: The Henan Rural Cohort study
    Niu, Kailin
    Lyu, Quanjun
    Zhang, Shuhua
    Wang, Chongjian
    Mao, Zhenxing
    Cui, Songyang
    Gu, Ruohua
    Li, Linlin
    [J]. PRIMARY CARE DIABETES, 2023, 17 (02) : 161 - 167
  • [46] Hypertriglyceridemia-waist and risk of developing type 2 diabetes: The Rural Chinese Cohort Study
    Yongcheng Ren
    Yu Liu
    Xizhuo Sun
    Kunpeng Deng
    Chongjian Wang
    Linlin Li
    Lu Zhang
    Bingyuan Wang
    Yang Zhao
    Junmei Zhou
    Chengyi Han
    Hongyan Zhang
    Xiangyu Yang
    Xinping Luo
    Chao Pang
    Lei Yin
    Tianping Feng
    Jingzhi Zhao
    Ming Zhang
    Dongsheng Hu
    [J]. Scientific Reports, 7
  • [47] Hypertriglyceridemia-waist and risk of developing type 2 diabetes: The Rural Chinese Cohort Study
    Ren, Yongcheng
    Liu, Yu
    Sun, Xizhuo
    Deng, Kunpeng
    Wang, Chongjian
    Li, Linlin
    Zhang, Lu
    Wang, Bingyuan
    Zhao, Yang
    Zhou, Junmei
    Han, Chengyi
    Zhang, Hongyan
    Yang, Xiangyu
    Luo, Xinping
    Pang, Chao
    Yin, Lei
    Feng, Tianping
    Zhao, Jingzhi
    Zhang, Ming
    Hu, Dongsheng
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [48] SOCS3 methylation mediated the effect of sedentary time on type 2 diabetes mellitus: The Henan Rural Cohort study
    Liu, Xiaotian
    Qian, Xinling
    Tu, Runqi
    Mao, Zhenxing
    Huo, Wenqian
    Zhang, Haiqing
    Jiang, Jingjing
    Zhang, Xia
    Tian, Zhongyan
    Li, Yuqian
    Wang, Chongjian
    [J]. NUTRITION METABOLISM AND CARDIOVASCULAR DISEASES, 2020, 30 (04) : 634 - 643
  • [49] Association of sleep duration with risk of type 2 diabetes mellitus in a rural Chinese population: a nested case-control study
    Cui, Songyang
    Li, Yuqian
    Chen, Yu
    Ren, Pengfei
    Fan, Mengying
    Yang, Xiu
    Wang, Chongjian
    Zhang, Lulu
    Han, Shengna
    Li, Linlin
    [J]. SLEEP AND BREATHING, 2022, 26 (04) : 2025 - 2033
  • [50] The Dynamics of Type 2 Diabetes Mellitus Prevalence and Management Rates among Rural Population in Henan Province, China
    Liu, Xiaotian
    Wang, Ling
    Wang, Panpan
    Liu, Ruihua
    Yang, Kaili
    Qian, Xinling
    Fan, Jingjing
    Yu, Songcheng
    Li, Yuqian
    Wang, Chongjian
    [J]. JOURNAL OF DIABETES RESEARCH, 2017, 2017