Machine Learning-Based Approach for Predicting Diabetes Employing Socio-Demographic Characteristics

被引:1
|
作者
Rahman, Md. Ashikur [1 ]
Abdulrazak, Lway Faisal [2 ]
Ali, Md. Mamun [1 ,3 ,4 ]
Mahmud, Imran [1 ]
Ahmed, Kawsar [4 ,5 ,6 ]
Bui, Francis M. [3 ,5 ]
机构
[1] Daffodil Int Univ, Dept Software Engn, Daffodil Smart City DSC, Savar 1216, Bangladesh
[2] Cihan Univ Sulaimaniya, Dept Comp Sci, Sulaimaniya 46001, Kurdistan, Iraq
[3] Univ Saskatchewan, Div Biomed Engn, 57 Campus Dr, Saskatoon, SK S7N 5A9, Canada
[4] Daffodil Int Univ, Dept Comp Sci & Engn, Hlth Informat Res Lab, Savar 1216, Bangladesh
[5] Univ Saskatchewan, Dept Elect & Comp Engn, 57 Campus Dr, Saskatoon, SK S7N 5A9, Canada
[6] Mawlana Bhashani Sci & Technol Univ, Dept Informat & Commun Technol, Grp Biophotomatiχ, Tangail 1902, Bangladesh
基金
加拿大自然科学与工程研究理事会;
关键词
diabetes; socio-demographic characteristics; machine learning; polydipsia; sudden weight loss; DIAGNOSIS;
D O I
10.3390/a16110503
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diabetes is one of the fatal diseases that play a vital role in the growth of other diseases in the human body. From a clinical perspective, the most significant approach to mitigating the effects of diabetes is early-stage control and management, with the aim of a potential cure. However, lack of awareness and expensive clinical tests are the primary reasons why clinical diagnosis and preventive measures are neglected in lower-income countries like Bangladesh, Pakistan, and India. From this perspective, this study aims to build an automated machine learning (ML) model, which will predict diabetes at an early stage using socio-demographic characteristics rather than clinical attributes, due to the fact that clinical features are not always accessible to all people from lower-income countries. To find the best fit of the supervised ML classifier of the model, we applied six classification algorithms and found that RF outperformed with an accuracy of 99.36%. In addition, the most significant risk factors were found based on the SHAP value by all the applied classifiers. This study reveals that polyuria, polydipsia, and delayed healing are the most significant risk factors for developing diabetes. The findings indicate that the proposed model is highly capable of predicting diabetes in the early stages.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] THE INFLUENCE OF SOCIO-DEMOGRAPHIC CHARACTERISTICS ON ENVIRONMENTAL AWARENESS
    Lysenko-Ryba, Kateryna
    Zimon, Dominik
    Zatwarnicka-Madura, Beata
    INTERNATIONAL JOURNAL FOR QUALITY RESEARCH, 2024, 18 (04) : 1259 - 1268
  • [32] Socio-demographic characteristics as correlates of psychological distress
    Okoro, Johnson Nwabueze
    Ezeonwuka, Chinenye Nnenna
    Onu, Justus Uchenna
    INTERNATIONAL JOURNAL OF PRISONER HEALTH, 2018, 14 (03) : 210 - 219
  • [33] Author Correction: Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics
    Sandra C. Matz
    Christina S. Bukow
    Heinrich Peters
    Christine Deacons
    Alice Dinu
    Clemens Stachl
    Scientific Reports, 13
  • [34] A machine learning-based approach to predicting the malignant and metastasis of thyroid cancer
    Gu, Jianhua
    Xie, Rongli
    Zhao, Yanna
    Zhao, Zhifeng
    Xu, Dan
    Ding, Min
    Lin, Tingyu
    Xu, Wenjuan
    Nie, Zihuai
    Miao, Enjun
    Tan, Dan
    Zhu, Sibo
    Shen, Dongjie
    Fei, Jian
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [35] THPep: A machine learning-based approach for predicting tumor homing peptides
    Shoombuatong, Watshara
    Schaduangrat, Nalini
    Pratiwi, Reny
    Nantasenamat, Chanin
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 80 : 441 - 451
  • [36] Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection
    K. M. Aslam Uddin
    Farida Siddiqi Prity
    Maisha Tasnim
    Sumiya Nur Jannat
    Mohammad Omar Faruk
    Jahirul Islam
    Saydul Akbar Murad
    Apurba Adhikary
    Anupam Kumar Bairagi
    Human-Centric Intelligent Systems, 2023, 3 (4): : 441 - 460
  • [37] Socio-demographic risk factors of Gestational Diabetes Mellitus
    Khan, Radhia
    Ali, Khurshid
    Khan, Zakkia
    PAKISTAN JOURNAL OF MEDICAL SCIENCES, 2013, 29 (03) : 843 - 846
  • [38] CHOICE OF INJECTABLE THERAPY IN THE TYPE 2 DIABETES TRAJECTORY: SOCIO-DEMOGRAPHIC AND CLINICAL CHARACTERISTICS
    Reges, O.
    Feldman, B.
    Gofer, I
    Karpati, T.
    Leibowitz, M.
    Balicer, R. D.
    Curtis, B. H.
    He, X.
    Rubin, G.
    Strizek, A. A.
    Leventer-Roberts, M.
    VALUE IN HEALTH, 2017, 20 (05) : A176 - A176
  • [39] HUNTERS IN CROATIA AS A SOCIO-GEOGRAPHIC GROUP AND THEIR SOCIO-DEMOGRAPHIC CHARACTERISTICS
    Pejnovic, Dane
    Krapinec, Kresimir
    Slamar, Maja
    SUMARSKI LIST, 2010, 134 (9-10): : 461 - 474
  • [40] Socio-demographic Characteristics and the Three Delays of Maternal Mortality
    Shah, Nusrat
    Hossain, Nazli
    Shoaib, Rizwana
    Hussain, Ayesha
    Gillani, Rehma
    Khan, Nusrat H.
    JCPSP-JOURNAL OF THE COLLEGE OF PHYSICIANS AND SURGEONS PAKISTAN, 2009, 19 (02): : 95 - 98