Stroke Dataset Modeling: Comparative Study of Machine Learning Classification Methods

被引:1
|
作者
Kitova, Kalina [1 ]
Ivanov, Ivan [1 ]
Hooper, Vincent [2 ]
机构
[1] Sofia Univ St Kl Ohridski, Fac Econ & Business Adm, Sofia 1113, Bulgaria
[2] Dubai Int Acad City, SP Jain Sch Global Management, POB 502345, Dubai, U Arab Emirates
关键词
stroke prediction; machine learning modeling; classification models; imbalanced dataset;
D O I
10.3390/a17120571
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stroke prediction is a vital research area due to its significant implications for public health. This comparative study offers a detailed evaluation of algorithmic methodologies and outcomes from three recent prominent studies on stroke prediction. Ivanov et al. tackled issues of imbalanced datasets and algorithmic bias using deep learning techniques, achieving notable results with a 98% accuracy and a 97% recall rate. They utilized resampling methods to balance the classes and advanced imputation techniques to handle missing data, underscoring the critical role of data preprocessing in enhancing the performance of Support Vector Machines (SVMs). Hassan et al. addressed missing data and class imbalance using multiple imputations and the Synthetic Minority Oversampling Technique (SMOTE). They developed a Dense Stacking Ensemble (DSE) model with over 96% accuracy. Their results underscore the efficiency of ensemble learning techniques and imputation for handling imbalanced datasets in stroke prediction. Bathla et al. employed various classifiers and feature selection techniques, including SMOTE, for class balancing. Their Random Forest (RF) classifier, combined with Feature Importance (FI) selection, achieved an accuracy of 97.17%, illustrating the positive impact of RF and relevant feature selection on model performance. A comparative analysis indicated that Ivanov et al.'s method achieved the highest accuracy rate. However, the studies collectively highlight that the choice of models and techniques for stroke prediction should be tailored to the specific characteristics of the dataset used. This study emphasizes the importance of effective data management and model selection in enhancing predictive performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] A Comparative Study of Machine Learning Methods for Traffic Sign Recognition
    Schuszter, Ioan Cristian
    2017 19TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2017), 2017, : 389 - 392
  • [42] Comparative study of machine learning methods for modeling associations between risk factors and future dementia cases
    Vaka Valsdóttir
    María K. Jónsdóttir
    Brynja Björk Magnúsdóttir
    Milan Chang
    Yi-Han Hu
    Vilmundur Gudnason
    Lenore J. Launer
    Hlynur Stefánsson
    GeroScience, 2024, 46 : 737 - 750
  • [43] Comparative study of machine learning methods for modeling associations between risk factors and future dementia cases
    Valsdottir, Vaka
    Jonsdottir, Maria K.
    Magnusdottir, Brynja Bjoerk
    Chang, Milan
    Hu, Yi-Han
    Gudnason, Vilmundur
    Launer, Lenore J.
    Stefansson, Hlynur
    GEROSCIENCE, 2024, 46 (01) : 737 - 750
  • [44] Improving the accuracy of multiclass classification in machine learning: A case study in a cell signaling dataset
    Pablo Gonzalez-Perez, Pedro
    Eduardo Sanchez-Gutierrez, Maximo
    INTELLIGENT DATA ANALYSIS, 2022, 26 (02) : 481 - 500
  • [45] A Comparative Study of Machine Learning Methods for Computational Modeling of the Selective Laser Melting Additive Manufacturing Process
    Chaudhry, Shubham
    Soulaimani, Azzeddine
    APPLIED SCIENCES-BASEL, 2022, 12 (05):
  • [46] A Comparative Study of Local Net Modeling Using Machine Learning
    Melchert, Jackson
    Zhang, Boyu
    Davoodi, Azadeh
    PROCEEDINGS OF THE 2018 GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI'18), 2018, : 273 - 278
  • [47] Comparative Study of different Lazy Learning Associative Classification Methods
    Tamrakar, Preeti
    Ibrahim, Syed S. P.
    2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 370 - 376
  • [48] An Exploratory Study in Classification Methods for Patients' Dataset
    Mutalib, Sofianita
    Ali, Nor Azlin
    Rahman, Shuzlina Abdul
    Mohamed, Azlinah
    2009 2ND CONFERENCE ON DATA MINING AND OPTIMIZATION, 2009, : 86 - 90
  • [49] Classification of Intrusion Detection Dataset using machine learning Approaches
    Subramanyam, Doodipalli
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES, ELECTRONICS AND MECHANICAL SYSTEMS (CTEMS), 2018, : 280 - 283
  • [50] A Comparative Study on the Ship Classification Performance of the Deep Learning Model According to Dataset Difference
    Moon, SungWon
    Kim, YoonHyung
    Nam, Dowon
    Yoo, Wonyoung
    Kim, Changick
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1428 - 1430