Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches

被引:76
|
作者
Joshi, Ram D. [1 ]
Dhakal, Chandra K. [2 ]
机构
[1] Texas Tech Univ, Dept Econ, Lubbock, TX 79409 USA
[2] Univ Georgia, Dept Agr & Appl Econ, Athens, GA 30602 USA
关键词
decision tree; diabetes risk factors; machine learning; prediction accuracy; INSULIN-RESISTANCE; RISK-FACTORS; LIFE-STYLE; MELLITUS; RECOMMENDATIONS; POPULATION; DISEASES; OBESITY; TOOL;
D O I
10.3390/ijerph18147346
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Diabetes mellitus is one of the most common human diseases worldwide and may cause several health-related complications. It is responsible for considerable morbidity, mortality, and economic loss. A timely diagnosis and prediction of this disease could provide patients with an opportunity to take the appropriate preventive and treatment strategies. To improve the understanding of risk factors, we predict type 2 diabetes for Pima Indian women utilizing a logistic regression model and decision tree-a machine learning algorithm. Our analysis finds five main predictors of type 2 diabetes: glucose, pregnancy, body mass index (BMI), diabetes pedigree function, and age. We further explore a classification tree to complement and validate our analysis. The six-fold classification tree indicates glucose, BMI, and age are important factors, while the ten-node tree implies glucose, BMI, pregnancy, diabetes pedigree function, and age as the significant predictors. Our preferred specification yields a prediction accuracy of 78.26% and a cross-validation error rate of 21.74%. We argue that our model can be applied to make a reasonable prediction of type 2 diabetes, and could potentially be used to complement existing preventive measures to curb the incidence of diabetes and reduce associated costs.
引用
下载
收藏
页数:17
相关论文
共 50 条
  • [1] Comparison of Statistical Logistic Regression and RandomForest Machine Learning Techniques in Predicting Diabetes
    Daghistani, Tahani
    Alshammari, Riyad
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2020, 11 (02) : 78 - 83
  • [2] Prediction of preterm birth in multiparous women using logistic regression and machine learning approaches
    Reza Arabi Belaghi
    Scientific Reports, 14 (1)
  • [3] Regression Analysis Using Machine Learning Approaches for Predicting Container Shipping Rates
    Khan, Ibraheem Abdulhafiz
    Hussain, Farookh Khadeer
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, AINA-2022, VOL 2, 2022, 450 : 269 - 280
  • [4] Logistic regression was as good as machine learning for predicting major chronic diseases
    Nusinovici, Simon
    Tham, Yih Chung
    Yan, Marco Yu Chak
    Ting, Daniel Shu Wei
    Li, Jialiang
    Sabanayagam, Charumathi
    Wong, Tien Yin
    Cheng, Ching-Yu
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2020, 122 : 56 - 69
  • [5] Comparison between multiple logistic regression and machine learning methods in prediction of abnormal thallium scans in type 2 diabetes
    Yang, Chung-Chi
    Peng, Chung-Hsin
    Huang, Li-Ying
    Chen, Fang Yu
    Kuo, Chun-Heng
    Wu, Chung-Ze
    Hsia, Te-Lin
    Lin, Chung-Yu
    WORLD JOURNAL OF CLINICAL CASES, 2023, 11 (33)
  • [6] A Gene Prediction Function for Type 2 Diabetes Mellitus using Logistic Regression
    Alshamlan, Hala
    Bin Taleb, Hind
    Al Sahow, Areej
    2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2020, : 038 - 041
  • [7] Predicting Progression of Type 2 Diabetes Using Primary Care Data with the Help of Machine Learning
    Ozturk, Berk
    Lawton, Tom
    Smith, Stephen
    Habli, Ibrahim
    CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 38 - 42
  • [8] Predicting Diabetes Using Machine Learning Techniques
    Kirgil, Elif Nur Haner
    Erkal, Begum
    Ayyildiz, Tulin Ercelebi
    2022 INTERNATIONAL CONFERENCE ON THEORETICAL AND APPLIED COMPUTER SCIENCE AND ENGINEERING (ICTASCE), 2022, : 137 - 141
  • [9] Mortality risk prediction in burn injury: Comparison of logistic regression with machine learning approaches
    Stylianou, Neophytos
    Akbarov, Artur
    Kontopantelis, Evangelos
    Buchan, Iain
    Dunn, Ken W.
    BURNS, 2015, 41 (05) : 925 - 934
  • [10] Predicting the Risk of Incident Type 2 Diabetes Mellitus in Chinese Elderly Using Machine Learning Techniques
    Liu, Qing
    Zhang, Miao
    He, Yifeng
    Zhang, Lei
    Zou, Jingui
    Yan, Yaqiong
    Guo, Yan
    JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (06):