Estimation of Diabetes in a High -Risk Adult Chinese Population Using J48 Decision Tree Model

被引:5
|
作者
Pei, Dongmei [1 ]
Yang, Tengfei [1 ]
Zhang, Chengpu [1 ]
机构
[1] China Med Univ, Dept Hlth Management, Shengjing Hosp, 36 Sanhao St, Shenyang 110004, Peoples R China
关键词
diabetes; J48; algorithm; decision tree; risk factors; LOGISTIC-REGRESSION; PREDICTIVE MODELS; FOLLOW-UP; TYPE-2; PREVENTION;
D O I
10.2147/DMSO.S279329
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: To predict and make an early diagnosis of diabetes is a critical approach in a population with high risk of diabetes, one of the devastating diseases globally. Traditional and conventional blood tests are recommended for screening the suspected patients; how ever, applying these tests could have health side effects and expensive cost. The goal of this study was to establish a simple and reliable predictive model based on the risk factors associated with diabetes using a decision tree algorithm. Methods: A retrospective cross-sectional study was used in this study. A total of 10,436 participants who had a health check-up from January 2017 to July 2017 were recruited. With appropriate data mining approaches, 3454 participants remained in the final dataset for further analysis. Seventy percent of these participants (2420 cases) were then randomly allocated to either the training dataset for the construction of the decision tree or the testing dataset (30%, 1034 cases) for evaluation of the performance of the decision tree. For this purpose, the cost-sensitive J48 algorithm was used to develop the decision tree model. Results: Utilizing all the key features of the dataset consisting of 14 input variables and two output variables, the constructed decision tree model identified several key factors that are closely linked to the development of diabetes and are also modifiable. Furthermore, our model achieved an accuracy of classification of 90.3% with a precision of 89.7% and a recall of 90.3%. Conclusion: By applying simple and cost-effective classification rules, our decision tree model estimates the development of diabetes in a high-risk adult Chinese population with strong potential for implementation of diabetes management.
引用
收藏
页码:4621 / 4630
页数:10
相关论文
共 50 条
  • [41] Prevalence and risk factors for type 2 diabetes mellitus in the Chinese adult population: The InterASIA Study
    Hu, Dongsheng
    Sun, Liang
    Fu, Pengyu
    Xie, Jing
    Lu, Jie
    Zhou, Jing
    Yu, Dahai
    Whelton, Paul K.
    He, Jiang
    Gu, Dongfeng
    DIABETES RESEARCH AND CLINICAL PRACTICE, 2009, 84 (03) : 288 - 295
  • [42] Risk factors associated with the dramatic increase in the prevalence of diabetes in the adult Chinese population in Qingdao, China
    Ning, F.
    Pang, Z. C.
    Dong, Y. H.
    Gao, W. G.
    Nan, H. R.
    Wang, S. J.
    Zhang, L.
    Ren, J.
    Tuomilehto, J.
    Hammar, N.
    Malmberg, K.
    Andersson, S. W.
    Qiao, Q.
    DIABETIC MEDICINE, 2009, 26 (09) : 855 - 863
  • [43] The Risk Factors of Laryngeal Pathology in Korean Adults Using a Decision Tree Model
    Byeon, Haewon
    JOURNAL OF VOICE, 2015, 29 (01) : 59 - 64
  • [44] An assessment of the risk factors for vitamin D deficiency using a decision tree model
    Gonoodi, Kayhan
    Tayefi, Maryam
    Saberi-Karimian, Maryam
    Zadeh, Alireza Amirabadi
    Darroudi, Susan
    Farahmand, Seyed Kazem
    Abasalti, Zahra
    Moslem, Alireza
    Nematy, Mohsen
    Ferns, Gordon A.
    Eslami, Saeid
    Mobarhan, Majid Ghayour
    DIABETES & METABOLIC SYNDROME-CLINICAL RESEARCH & REVIEWS, 2019, 13 (03) : 1773 - 1777
  • [45] Fuzzy decision tree approach for embedding risk assessment information into software cost estimation model
    Huang, SJ
    Lin, CY
    Chiu, NH
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2006, 22 (02) : 297 - 313
  • [46] Solar energy potential assessment of western Himalayan Indian state of Himachal Pradesh using J48 algorithm of WEKA in ANN based prediction model
    Yadav, Amit Kumar
    Chandel, S. S.
    RENEWABLE ENERGY, 2015, 75 : 675 - 693
  • [47] Hyperuricemia Accompanied with Changes in the Retinal Microcirculation in a Chinese High-risk Population for Diabetes
    M. Kamran IKRAM
    Biomedical and Environmental Sciences, 2011, 24 (02) : 146 - 154
  • [48] Hyperuricemia Accompanied with Changes in the Retinal Microcirculation in a Chinese High-risk Population for Diabetes
    Yuan YuanZhi
    Ikram, M. Kamran
    Jiang SunFang
    Lin HuanDong
    Ren LiMin
    Yan HongMei
    Sheng JianHua
    Chen XuSheng
    Gao Xin
    BIOMEDICAL AND ENVIRONMENTAL SCIENCES, 2011, 24 (02) : 146 - 154
  • [49] Case study on high dimensional data analysis using decision tree model
    Smitha, T., 1600, International Journal of Computer Science Issues (IJCSI) (09): : 3 - 3
  • [50] Intrusion Detection using Decision Tree Model in High-Speed Environment
    Rathore, M. Mazhar
    Saeed, Faisal
    Rehman, Abdul
    Paul, Anand
    Daniel, Alfred
    IEEE INTERNATIONAL CONFERENCE ON SOFT-COMPUTING AND NETWORK SECURITY (ICSNS 2018), 2018, : 301 - 305