Predicting Diabetes Mellitus With Machine Learning Techniques

被引:343
|
作者
Zou, Quan [1 ,2 ]
Qu, Kaiyang [1 ]
Luo, Yamei [3 ]
Yin, Dehui [3 ]
Ju, Ying [4 ]
Tang, Hua [5 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin, Peoples R China
[2] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Chengdu, Sichuan, Peoples R China
[3] Southwest Med Univ, Sch Med Informat & Engn, Luzhou, Peoples R China
[4] Xiamen Univ, Sch Informat Sci & Technol, Xiamen, Peoples R China
[5] Southwest Med Univ, Sch Basic Med, Dept Pathophysiol, Luzhou, Peoples R China
关键词
diabetes mellitus; random forest; decision tree; neural network; machine learning; feature ranking; RANDOM FOREST; FEATURE-SELECTION; DIAGNOSIS; CLASSIFICATION; EXTRACTION; TOOL;
D O I
10.3389/fgene.2018.00515
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Diabetes mellitus is a chronic disease characterized by hyperglycemia. It may cause many complications. According to the growing morbidity in recent years, in 2040, the world's diabetic patients will reach 642 million, which means that one of the ten adults in the future is suffering from diabetes. There is no doubt that this alarming figure needs great attention. With the rapid development of machine learning, machine learning has been applied to many aspects of medical health. In this study, we used decision tree, random forest and neural network to predict diabetes mellitus. The dataset is the hospital physical examination data in Luzhou, China. It contains 14 attributes. In this study, five-fold cross validation was used to examine the models. In order to verity the universal applicability of the methods, we chose some methods that have the better performance to conduct independent test experiments. We randomly selected 68994 healthy people and diabetic patients' data, respectively as training set. Due to the data unbalance, we randomly extracted 5 times data. And the result is the average of these five experiments. In this study, we used principal component analysis (PCA) and minimum redundancy maximum relevance (mRMR) to reduce the dimensionality. The results showed that prediction with random forest could reach the highest accuracy (ACC = 0.8084) when all the attributes were used.
引用
下载
收藏
页数:10
相关论文
共 50 条
  • [1] Predicting Diabetes Using Machine Learning Techniques
    Kirgil, Elif Nur Haner
    Erkal, Begum
    Ayyildiz, Tulin Ercelebi
    2022 INTERNATIONAL CONFERENCE ON THEORETICAL AND APPLIED COMPUTER SCIENCE AND ENGINEERING (ICTASCE), 2022, : 137 - 141
  • [2] Machine Learning Tree Classifiers in Predicting Diabetes Mellitus
    Vigneswari, D.
    Kumar, N. Komal
    Raj, V. Ganesh
    Gugan, A.
    Vikash, S. R.
    2019 5TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2019, : 84 - 87
  • [3] Analysis of Diabetes mellitus using Machine Learning Techniques
    Bhat, Salliah Shafi
    Selvam, Venkatesan
    Ansari, Gufran Ahmad
    Ansari, Mohd Dilshad
    2022 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2022,
  • [4] Predicting Diabetes Mellitus With Machine Learning Techniques Using Multi-Criteria Decision Making
    Juneja, Abhinav
    Juneja, Sapna
    Kaur, Sehajpreet
    Kumar, Vivek
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2021, 11 (02) : 38 - 52
  • [5] Towards a Stacking Ensemble Model for Predicting Diabetes Mellitus using Combination of Machine Learning Techniques
    Alzubaidi, Abdulaziz A.
    Halawani, Sami M.
    Jarrah, Mutasem
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (12) : 348 - 358
  • [6] A Scoping Review of Supervised Machine Learning Techniques in Predicting the Prevalence of Type 2 Diabetes Mellitus
    Rizal, M. F. Mohd
    Maulud, K. N. Abdul
    Ganasegeran, K.
    Manaf, M. R. Abdul
    Safian, N.
    Mustapha, F., I
    Waller, L. A.
    MEDICINE AND HEALTH, 2024, 19 (02): : 380 - 399
  • [7] Predicting the Risk of Incident Type 2 Diabetes Mellitus in Chinese Elderly Using Machine Learning Techniques
    Liu, Qing
    Zhang, Miao
    He, Yifeng
    Zhang, Lei
    Zou, Jingui
    Yan, Yaqiong
    Guo, Yan
    JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (06):
  • [8] Predictive models for diabetes mellitus using machine learning techniques
    Lai, Hang
    Huang, Huaxiong
    Keshavjee, Karim
    Guergachi, Aziz
    Gao, Xin
    BMC ENDOCRINE DISORDERS, 2019, 19 (01)
  • [9] Predictive models for diabetes mellitus using machine learning techniques
    Hang Lai
    Huaxiong Huang
    Karim Keshavjee
    Aziz Guergachi
    Xin Gao
    BMC Endocrine Disorders, 19
  • [10] Machine Learning Techniques for Diabetes Mellitus Based on Lifestyle Predictors
    Ansari, Gufran Ahmad
    Bhat, Salliah Shafi
    Ansari, Mohd Dilshad
    RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2024,