Machine Learning Techniques for Diabetes Mellitus Based on Lifestyle Predictors

被引:0
|
作者
Ansari, Gufran Ahmad [1 ]
Bhat, Salliah Shafi [2 ]
Ansari, Mohd Dilshad [3 ]
机构
[1] Univ Hail, Dept Comp Sci & Software Engn, Hail, Saudi Arabia
[2] BS Abdur Rahman Crescent Inst Sci & Technol, Chennai 48, India
[3] SRM Univ Delhi NCR, Sonepat, Haryana, India
关键词
Machine learning; support vector classification; decision tree classification; K-Nearest classification; logistic regression; diabetes mellitus; lifestyle predictors;
D O I
10.2174/0123520965291435240508111712
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Background Diabetes has been rising in recent years and prior research has demonstrated Machine Learning Techniques (MLTs) to be useful tools for predicting diabetes. This research has examined the accuracy of six different MLTs for predicting diabetes using lifestyle data gathered from UCI (University of California). To improve medical outcomes and prevent its onset, the prediction of diabetes is necessary. This research has proposed a new framework based on the early detection of diabetes using lifestyle factors. Various MLTs, such as Logistic Regression (LR), Decision Tree Classification (DTC), Random Forest Classification (RFC), Support Vector Classification (SVC), and K-Nearest Classification (KNC) have been used for tenfold cross-validation and the results obtained from different techniques have been verified. Among all classification techniques, LR has achieved the highest accuracy of 93%, the precision of 92%, the recall score of 94%, the F1 score of 93%, and the weighted average of 90%, respectively. The proposed framework is utilized by the healthcare sector to predict diabetes early. It can also be used with datasets from various sectors that share diabetes-related data.Methods In this paper, we have used the proposed framework to predict diabetes mellitus in the healthcare system, diagnose various ailments, and assess if MLA performs well. The proposed system has been developed based on the MLT for the classification of DM. An intelligent framework for Diabetes Mellitus (DM) that has been developed using MLT illustrates the full workflow from data input to output. The five algorithms, Logistic Regression (LR), Decision Tree Classification (DTC), Random Forest Classification (RFC), Support Vector Classification (SVC), and K-Nearest Classification (KNC), have been compared in terms of accuracy, precision, recall, and F1 score.Results Results from the experimental setting using MLTs for DM prediction based on lifestyle predictors have been obtained. Descriptive statistics of lifestyle characteristics have been displayed along with their corresponding metrics, such as mean, standard deviation, minimum, maximum, etc. For instance, the age parameters' mean, standard, and minimum at 25%, 50%, 75%, and maximum values were as follows: 520.0, 48.02, 12.151, 16.0, 39.0, 47.5, 57.0, and 90.0 respectively, as shown in Fig. (10). Feature engineering is crucial to the process of constructing MLT. Insignificant or incorrect characteristics may have a negative impact on the way a model runs. The training time is drastically reduced and accuracy is increased with careful feature selection. In machine learning frameworks, some feature selection strategies include embedding, filter, wrapper, embedded, and hybrid techniques. An alarming number of people around the world suffer from the chronic and dangerous disease of diabetes. Using MLT, early DM prediction-based biological variables have been obtained in this research work. Data on patients' lifestyles have been thoroughly examined in order to create a framework. The Canonical-correlation Analysis (CCA) has been used to select the ideal combination of lifestyle features. Finally, 10-fold cross-validations have been used to apply five alternative machine learning techniques for the prediction of disease.Conclusion To our knowledge, it is the first time a framework has been proposed that has yielded prediction results so much better than those from earlier research. The results obtained in this suggested work have been found accurate and reliable by metrics evaluation.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Predicting Diabetes Mellitus With Machine Learning Techniques
    Zou, Quan
    Qu, Kaiyang
    Luo, Yamei
    Yin, Dehui
    Ju, Ying
    Tang, Hua
    [J]. FRONTIERS IN GENETICS, 2018, 9
  • [2] Analysis of Diabetes mellitus using Machine Learning Techniques
    Bhat, Salliah Shafi
    Selvam, Venkatesan
    Ansari, Gufran Ahmad
    Ansari, Mohd Dilshad
    [J]. 2022 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2022,
  • [3] Predictive models for diabetes mellitus using machine learning techniques
    Lai, Hang
    Huang, Huaxiong
    Keshavjee, Karim
    Guergachi, Aziz
    Gao, Xin
    [J]. BMC ENDOCRINE DISORDERS, 2019, 19 (01)
  • [4] Predictive models for diabetes mellitus using machine learning techniques
    Hang Lai
    Huaxiong Huang
    Karim Keshavjee
    Aziz Guergachi
    Xin Gao
    [J]. BMC Endocrine Disorders, 19
  • [5] Metabolic Syndrome and Development of Diabetes Mellitus: Predictive Modeling Based on Machine Learning Techniques
    Perveen, Sajida
    Shahbaz, Muhammad
    Keshavjee, Karim
    Guergachi, Aziz
    [J]. IEEE ACCESS, 2019, 7 : 1365 - 1375
  • [6] Evaluation of predisposing factors of Diabetes Mellitus post Gestational Diabetes Mellitus using Machine Learning Techniques
    Krishnan, Devi R.
    Menakath, Gayathri P.
    Radhakrishnan, Anagha
    Himavarshini, Yarrangangu
    Aparna, A.
    Mukundan, Kaveri
    Pathinarupothi, Rahul Krishnan
    Alangot, Bithin
    Mahankali, Sirisha
    Maddipati, Chakravarthy
    [J]. 2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2019, : 81 - 85
  • [7] DIAGNOSIS OF DIABETES MELLITUS USING MACHINE LEARNING TECHNIQUES FOR EFFICIENT REVIEW
    Thiyagarajan, C.
    Vaideghy, A.
    Sridevi, V
    [J]. INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (02) : 4184 - 4187
  • [8] A COMPREHENSIVE ANALYSIS OF MACHINE LEARNING TECHNIQUES FOR INCESSANT PREDICTION OF DIABETES MELLITUS
    Reddy, Shiva Shankar
    Sethi, Nilambar
    Rajender, R.
    [J]. INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2020, 13 (01): : 1 - 22
  • [9] Decision Support System for Diabetes Mellitus through Machine Learning Techniques
    Rashid, Tarik A.
    Abdulla, Saman. M.
    Abdulla, Rezhna. M.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (07) : 170 - 178
  • [10] Performance analysis and prediction of type 2 diabetes mellitus based on lifestyle data using machine learning approaches
    Ganie, Shahid Mohammad
    Malik, Majid Bashir
    Arif, Tasleem
    [J]. JOURNAL OF DIABETES AND METABOLIC DISORDERS, 2022, 21 (01) : 339 - 352