Diabetes prediction using feature engineering and machine learning algorithms with security

被引:0
|
作者
Arora, Jyoti [1 ]
Rathee, Sonia [2 ]
Gahlan, Mamta [1 ]
Shalu, Amita Yadav [3 ]
机构
[1] Maharaja Surajmal Inst Technol, Dept Informat & Technol, Delhi, India
[2] Maharaja Surajmal Inst Technol, Dept Comp Sci & Engn, Delhi, India
[3] Maharaja Surajmal Inst Technol, Delhi, India
关键词
Diabetes prediction; Light gradient boosting machine (LGBM); Naive Bayes (NB); Random forest; Logistic regression (LR); Gradient boosting (GB); K-nearest neighbour (k-NN); XGBoost; Decision tree (DT); Support vector machines (SVM);
D O I
10.47974/JSMS-1253
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The prevalence of diabetes has been steadily increasing, necessitating accurate prediction models to assist in early diagnosis and proactive management. In this paper, a hybrid machine learning-based diabetes prediction model has been proposed. To evaluate the model, the dataset was subsequently divided into training and testing subsets. We used the Random Forest Classifier, Light Gradient Boosting Mechanism Classifier, Gradient Boosting Classifier, Logistic Regression, K-Nearest Neighbours (KNN) Classifier, Naive Bayes Gaussian, Decision Tree Classifier, XGBoost Classifier, and Support Vector Classifier as nine different classifiers. Several metrics were used to evaluate the models, including testing accuracy, recall score, F1 score, and precision score. We have evaluated our model on the "Pima Indian Diabetes Database"[1], which served as the main dataset, for diabetes prediction. The proposed model serves as a practical framework for researchers and practitioners interested in leveraging machine learning techniques for diabetes prediction.
引用
收藏
页码:273 / 284
页数:12
相关论文
共 50 条
  • [31] Comparative analysis and prediction of nucleosome positioning using integrative feature representation and machine learning algorithms
    Han, Guo-Sheng
    Li, Qi
    Li, Ying
    [J]. BMC BIOINFORMATICS, 2021, 22 (SUPPL 6)
  • [32] Prediction of Intrinsically Disordered Proteins Using Machine Learning Algorithms Based on Fuzzy Entropy Feature
    Zhang, Lin
    Liu, Haiyuan
    He, Hao
    [J]. ALGORITHMS, 2021, 14 (04)
  • [33] Comparative analysis and prediction of quorum-sensing peptides using feature representation learning and machine learning algorithms
    Wei, Leyi
    Hu, Jie
    Li, Fuyi
    Song, Jiangning
    Su, Ran
    Zou, Quan
    [J]. BRIEFINGS IN BIOINFORMATICS, 2020, 21 (01) : 106 - 119
  • [34] Security Engineering for Machine Learning
    McGraw, Gary
    Bonett, Richie
    Figueroa, Harold
    Shepardson, Victor
    [J]. COMPUTER, 2019, 52 (08) : 54 - 57
  • [35] Diabetes Prediction using Machine Learning Techniques
    Obulesu, O.
    Suresh, K.
    Ramudu, B. Venkata
    [J]. HELIX, 2020, 10 (02): : 136 - 142
  • [36] Heart disease prediction using entropy based feature engineering and ensembling of machine learning classifiers
    Rajendran, Rajkamal
    Karthi, Anitha
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 207
  • [37] Effective Feature Engineering Technique for Heart Disease Prediction With Machine Learning
    Qadri, Azam Mehmood
    Raza, Ali
    Munir, Kashif
    Almutairi, Mubarak S.
    [J]. IEEE ACCESS, 2023, 11 : 56214 - 56224
  • [38] Feature engineering for machine learning enabled early prediction of battery lifetime
    Paulson, Noah H.
    Kubal, Joseph
    Ward, Logan
    Saxena, Saurabh
    Lu, Wenquan
    Babinec, Susan J.
    [J]. JOURNAL OF POWER SOURCES, 2022, 527
  • [39] Diagnosis and Classification of the Diabetes Using Machine Learning Algorithms
    Theerthagiri P.
    Ruby A.U.
    Vidya J.
    [J]. SN Computer Science, 4 (1)
  • [40] Multiple disease prediction using Machine learning algorithms
    Arumugam K.
    Naved M.
    Shinde P.P.
    Leiva-Chauca O.
    Huaman-Osorio A.
    Gonzales-Yanac T.
    [J]. Materials Today: Proceedings, 2023, 80 : 3682 - 3685