Estimation of Diabetes in a High -Risk Adult Chinese Population Using J48 Decision Tree Model

被引:5
|
作者
Pei, Dongmei [1 ]
Yang, Tengfei [1 ]
Zhang, Chengpu [1 ]
机构
[1] China Med Univ, Dept Hlth Management, Shengjing Hosp, 36 Sanhao St, Shenyang 110004, Peoples R China
关键词
diabetes; J48; algorithm; decision tree; risk factors; LOGISTIC-REGRESSION; PREDICTIVE MODELS; FOLLOW-UP; TYPE-2; PREVENTION;
D O I
10.2147/DMSO.S279329
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: To predict and make an early diagnosis of diabetes is a critical approach in a population with high risk of diabetes, one of the devastating diseases globally. Traditional and conventional blood tests are recommended for screening the suspected patients; how ever, applying these tests could have health side effects and expensive cost. The goal of this study was to establish a simple and reliable predictive model based on the risk factors associated with diabetes using a decision tree algorithm. Methods: A retrospective cross-sectional study was used in this study. A total of 10,436 participants who had a health check-up from January 2017 to July 2017 were recruited. With appropriate data mining approaches, 3454 participants remained in the final dataset for further analysis. Seventy percent of these participants (2420 cases) were then randomly allocated to either the training dataset for the construction of the decision tree or the testing dataset (30%, 1034 cases) for evaluation of the performance of the decision tree. For this purpose, the cost-sensitive J48 algorithm was used to develop the decision tree model. Results: Utilizing all the key features of the dataset consisting of 14 input variables and two output variables, the constructed decision tree model identified several key factors that are closely linked to the development of diabetes and are also modifiable. Furthermore, our model achieved an accuracy of classification of 90.3% with a precision of 89.7% and a recall of 90.3%. Conclusion: By applying simple and cost-effective classification rules, our decision tree model estimates the development of diabetes in a high-risk adult Chinese population with strong potential for implementation of diabetes management.
引用
收藏
页码:4621 / 4630
页数:10
相关论文
共 50 条
  • [31] Using a Decision Tree Algorithm Predictive Model for Sperm Count Assessment and Risk Factors in Health Screening Population
    Huang, Hung-Hsiang
    Lu, Chi-Jie
    Jhou, Mao-Jhen
    Liu, Tzu-Chi
    Yang, Chih-Te
    Hsieh, Shang-Ju
    Yang, Wen-Jen
    Chang, Hsiao-Chun
    Chen, Ming-Shu
    RISK MANAGEMENT AND HEALTHCARE POLICY, 2023, 16 : 2469 - 2478
  • [32] Development and Validation of a Risk-Score Model for Type 2 Diabetes: A Cohort Study of a Rural Adult Chinese Population
    Zhang, Ming
    Zhang, Hongyan
    Wang, Chongjian
    Ren, Yongcheng
    Wang, Bingyuan
    Zhang, Lu
    Yang, Xiangyu
    Zhao, Yang
    Han, Chengyi
    Pang, Chao
    Yin, Lei
    Xue, Yuan
    Zhao, Jingzhi
    Hu, Dongsheng
    PLOS ONE, 2016, 11 (04):
  • [33] Inverse association between adult height and diabetes risk in a cohort study of Chinese population
    Xiaoli Li
    Tiantian Cheng
    Lina Leng
    Guangyao Song
    Huijuan Ma
    Scientific Reports, 13
  • [34] Inverse association between adult height and diabetes risk in a cohort study of Chinese population
    Li, Xiaoli
    Cheng, Tiantian
    Leng, Lina
    Song, Guangyao
    Ma, Huijuan
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [35] Reanalysis and External Validation of a Decision Tree Model for Detecting Unrecognized Diabetes in Rural Chinese Individuals
    Xin, Zhong
    Hua, Lin
    Wang, Xu-Hong
    Zhao, Dong
    Yu, Cai-Guo
    Ma, Ya-Hong
    Zhao, Lei
    Cao, Xi
    Yang, Jin-Kui
    INTERNATIONAL JOURNAL OF ENDOCRINOLOGY, 2017, 2017
  • [37] INTELLIGENT COOPERATIVE WEB CACHING POLICIES FOR MEDIA OBJECTS BASED ON J48 DECISION TREE AND NAIVE BAYES SUPERVISED MACHINE LEARNING ALGORITHMS IN STRUCTURED PEER-TO-PEER SYSTEMS
    Ibrahim, Hamidah
    Yasin, Waheed
    Udzir, Nur Izura
    Hamid, Nor Asilah Wati Abdul
    JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2016, 15 (02): : 85 - 116
  • [38] Estimation of the adult population at high risk of developing lung cancer in the European Union
    Gonzalez-Marron, Adrian
    Carlos Martin-Sanchez, Juan
    Matilla-Santander, Nuria
    Cartanya-Hueso, Aurea
    Lidon-Moyano, Cristina
    Vidal, Carmen
    Garcia, Montse
    Martinez-Sanchez, Jose M.
    CANCER EPIDEMIOLOGY, 2018, 57 : 140 - 147
  • [39] Risk Prediction Model of Gestational Diabetes Mellitus in a Chinese Population Based on a Risk Scoring System
    Yanmei Wang
    Zhijuan Ge
    Lei Chen
    Jun Hu
    Wenting Zhou
    Shanmei Shen
    Dalong Zhu
    Yan Bi
    Diabetes Therapy, 2021, 12 : 1721 - 1734
  • [40] Risk Prediction Model of Gestational Diabetes Mellitus in a Chinese Population Based on a Risk Scoring System
    Wang, Yanmei
    Ge, Zhijuan
    Chen, Lei
    Hu, Jun
    Zhou, Wenting
    Shen, Shanmei
    Zhu, Dalong
    Bi, Yan
    DIABETES THERAPY, 2021, 12 (06) : 1721 - 1734