Estimation of Diabetes in a High -Risk Adult Chinese Population Using J48 Decision Tree Model

被引:5
|
作者
Pei, Dongmei [1 ]
Yang, Tengfei [1 ]
Zhang, Chengpu [1 ]
机构
[1] China Med Univ, Dept Hlth Management, Shengjing Hosp, 36 Sanhao St, Shenyang 110004, Peoples R China
关键词
diabetes; J48; algorithm; decision tree; risk factors; LOGISTIC-REGRESSION; PREDICTIVE MODELS; FOLLOW-UP; TYPE-2; PREVENTION;
D O I
10.2147/DMSO.S279329
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: To predict and make an early diagnosis of diabetes is a critical approach in a population with high risk of diabetes, one of the devastating diseases globally. Traditional and conventional blood tests are recommended for screening the suspected patients; how ever, applying these tests could have health side effects and expensive cost. The goal of this study was to establish a simple and reliable predictive model based on the risk factors associated with diabetes using a decision tree algorithm. Methods: A retrospective cross-sectional study was used in this study. A total of 10,436 participants who had a health check-up from January 2017 to July 2017 were recruited. With appropriate data mining approaches, 3454 participants remained in the final dataset for further analysis. Seventy percent of these participants (2420 cases) were then randomly allocated to either the training dataset for the construction of the decision tree or the testing dataset (30%, 1034 cases) for evaluation of the performance of the decision tree. For this purpose, the cost-sensitive J48 algorithm was used to develop the decision tree model. Results: Utilizing all the key features of the dataset consisting of 14 input variables and two output variables, the constructed decision tree model identified several key factors that are closely linked to the development of diabetes and are also modifiable. Furthermore, our model achieved an accuracy of classification of 90.3% with a precision of 89.7% and a recall of 90.3%. Conclusion: By applying simple and cost-effective classification rules, our decision tree model estimates the development of diabetes in a high-risk adult Chinese population with strong potential for implementation of diabetes management.
引用
收藏
页码:4621 / 4630
页数:10
相关论文
共 50 条
  • [21] Application of an ensemble learning model based on random subspace and a J48 decision tree for landslide susceptibility mapping: a case study for Qingchuan, Sichuan, China
    Yangchun Li
    Feikai Lin
    Xiangang Luo
    Shuang Zhu
    Jiang Li
    Zhanya Xu
    Xiuwei Liu
    Shungen Luo
    Guangjie Huo
    Liangsheng Peng
    Haiping Feng
    Environmental Earth Sciences, 2022, 81
  • [22] Application of an ensemble learning model based on random subspace and a J48 decision tree for landslide susceptibility mapping: a case study for Qingchuan, Sichuan, China
    Li, Yangchun
    Lin, Feikai
    Luo, Xiangang
    Zhu, Shuang
    Li, Jiang
    Xu, Zhanya
    Liu, Xiuwei
    Luo, Shungen
    Huo, Guangjie
    Peng, Liangsheng
    Feng, Haiping
    ENVIRONMENTAL EARTH SCIENCES, 2022, 81 (09)
  • [23] Risk assessment of elevated blood lead concentrations in the adult population using a decision tree approach
    Amirabadizadeh, Alireza
    Nakhaee, Samaneh
    Mehrpour, Omid
    DRUG AND CHEMICAL TOXICOLOGY, 2022, 45 (02) : 878 - 885
  • [24] Developing a Prediction Model Using J48 Algorithm to Predict Symptoms of COVID-19 Causing Death
    Al Sadig, Mutasim
    Sattar, Khalid Nazim Abdul
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (08): : 80 - 83
  • [25] Ensemble human movement sequence prediction model with Apriori based Probability Tree Classifier (APTC) and Bagged J48 on Machine learning
    Raj, Sridhar S.
    Nandhini, M.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (04) : 408 - 416
  • [26] Landslide Susceptibility Assessment Using Bagging Ensemble Based Alternating Decision Trees, Logistic Regression and J48 Decision Trees Methods: A Comparative Study
    Pham B.T.
    Tien Bui D.
    Prakash I.
    Geotechnical and Geological Engineering, 2017, 35 (6) : 2597 - 2611
  • [27] Identification of Potential Type II Diabetes in a Chinese Population with a Sensitive Decision Tree Approach
    Pei, Dongmei
    Zhang, Chengpu
    Quan, Yu
    Guo, Qiyong
    JOURNAL OF DIABETES RESEARCH, 2019, 2019
  • [28] Building Predictive Model of Covid 19 Quarantine Impact on the Purchase of Environmentally Green Products by Using J48 and LMTAlgorithms
    Al Sadig, Mutasim
    Babikir, Nahid Osman Ali
    Ali, Faisal Mohammed Nafie
    Sattar, Khalid Nazim Abdul
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (01): : 517 - 522
  • [29] Markers of Liver Dysfunction and Risk of Type 2 Diabetes in Chinese Adult Population
    Yang, Zhaojun
    Xiao, Jianzhong
    Bu, Shi
    Ruan, Danjie
    Li, Yufeng
    Wang, Na
    Yang, Wenying
    DIABETES, 2009, 58 : A580 - A580
  • [30] High percentage of undiagnosed diabetes and at-risk individuals in a Chinese population
    Li, M
    Hu, GZ
    Chimera, J
    DIABETES, 2001, 50 : A479 - A480