Machine Learning-Based Approach for Predicting Diabetes Employing Socio-Demographic Characteristics

被引:1
|
作者
Rahman, Md. Ashikur [1 ]
Abdulrazak, Lway Faisal [2 ]
Ali, Md. Mamun [1 ,3 ,4 ]
Mahmud, Imran [1 ]
Ahmed, Kawsar [4 ,5 ,6 ]
Bui, Francis M. [3 ,5 ]
机构
[1] Daffodil Int Univ, Dept Software Engn, Daffodil Smart City DSC, Savar 1216, Bangladesh
[2] Cihan Univ Sulaimaniya, Dept Comp Sci, Sulaimaniya 46001, Kurdistan, Iraq
[3] Univ Saskatchewan, Div Biomed Engn, 57 Campus Dr, Saskatoon, SK S7N 5A9, Canada
[4] Daffodil Int Univ, Dept Comp Sci & Engn, Hlth Informat Res Lab, Savar 1216, Bangladesh
[5] Univ Saskatchewan, Dept Elect & Comp Engn, 57 Campus Dr, Saskatoon, SK S7N 5A9, Canada
[6] Mawlana Bhashani Sci & Technol Univ, Dept Informat & Commun Technol, Grp Biophotomatiχ, Tangail 1902, Bangladesh
基金
加拿大自然科学与工程研究理事会;
关键词
diabetes; socio-demographic characteristics; machine learning; polydipsia; sudden weight loss; DIAGNOSIS;
D O I
10.3390/a16110503
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diabetes is one of the fatal diseases that play a vital role in the growth of other diseases in the human body. From a clinical perspective, the most significant approach to mitigating the effects of diabetes is early-stage control and management, with the aim of a potential cure. However, lack of awareness and expensive clinical tests are the primary reasons why clinical diagnosis and preventive measures are neglected in lower-income countries like Bangladesh, Pakistan, and India. From this perspective, this study aims to build an automated machine learning (ML) model, which will predict diabetes at an early stage using socio-demographic characteristics rather than clinical attributes, due to the fact that clinical features are not always accessible to all people from lower-income countries. To find the best fit of the supervised ML classifier of the model, we applied six classification algorithms and found that RF outperformed with an accuracy of 99.36%. In addition, the most significant risk factors were found based on the SHAP value by all the applied classifiers. This study reveals that polyuria, polydipsia, and delayed healing are the most significant risk factors for developing diabetes. The findings indicate that the proposed model is highly capable of predicting diabetes in the early stages.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Cytokine gene variants and socio-demographic characteristics as predictors of cervical cancer: A machine learning approach
    Kaushik, Manoj
    Joshi, Rakesh Chandra
    Kushwah, Atar Singh
    Gupta, Maneesh Kumar
    Banerjee, Monisha
    Burget, Radim
    Dutta, Malay Kishore
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 134
  • [2] Predicting students' academic performance based on school and socio-demographic characteristics
    Thiele, Tamara
    Singleton, Alexander
    Pope, Daniel
    Stanistreet, Debbi
    STUDIES IN HIGHER EDUCATION, 2016, 41 (08) : 1424 - 1446
  • [3] Machine learning-based approach for predicting the consolidation characteristics of soft soil
    Singh, Moirangthem Johnson
    Kaushik, Anshul
    Patnaik, Gyanesh
    Xu, Dong-Sheng
    Feng, Wei-Qiang
    Rajput, Abhishek
    Prakash, Guru
    Borana, Lalit
    MARINE GEORESOURCES & GEOTECHNOLOGY, 2024, 42 (04) : 405 - 419
  • [4] The usefulness of socio-demographic variables in predicting purchase decisions: Evidence from machine learning procedures
    Islam, Towhidul
    Meade, Nigel
    Carson, Richard T.
    Louviere, Jordan J.
    Wang, Juan
    JOURNAL OF BUSINESS RESEARCH, 2022, 151 : 324 - 338
  • [5] Deep Learning-Based Socio-Demographic Information Identification From Smart Meter Data
    Wang, Yi
    Chen, Qixin
    Gan, Dahua
    Yang, Jingwei
    Kirschen, Daniel S.
    Kang, Chongqing
    IEEE TRANSACTIONS ON SMART GRID, 2019, 10 (03) : 2593 - 2602
  • [6] Predicting mergers & acquisitions: A machine learning-based approach
    Zhao, Yuchen
    Bi, Xiaogang
    Ma, Qing-Ping
    INTERNATIONAL REVIEW OF FINANCIAL ANALYSIS, 2025, 99
  • [7] Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics
    Matz, Sandra C.
    Bukow, Christina S.
    Peters, Heinrich
    Deacons, Christine
    Stachl, Clemens
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [8] A comparative investigation of machine learning algorithms for predicting safety signs comprehension based on socio-demographic factors and cognitive sign features
    Sajjad Rostamzadeh
    Alireza Abouhossein
    Mahnaz Saremi
    Fereshteh Taheri
    Mobin Ebrahimian
    Shahram Vosoughi
    Scientific Reports, 13
  • [9] A comparative investigation of machine learning algorithms for predicting safety signs comprehension based on socio-demographic factors and cognitive sign features
    Rostamzadeh, Sajjad
    Abouhossein, Alireza
    Saremi, Mahnaz
    Taheri, Fereshteh
    Ebrahimian, Mobin
    Vosoughi, Shahram
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [10] Using machine learning to predict student retention from socio-demographic characteristics and app-based engagement metrics
    Sandra C. Matz
    Christina S. Bukow
    Heinrich Peters
    Christine Deacons
    Alice Dinu
    Clemens Stachl
    Scientific Reports, 13