Stroke Dataset Modeling: Comparative Study of Machine Learning Classification Methods

被引:1
|
作者
Kitova, Kalina [1 ]
Ivanov, Ivan [1 ]
Hooper, Vincent [2 ]
机构
[1] Sofia Univ St Kl Ohridski, Fac Econ & Business Adm, Sofia 1113, Bulgaria
[2] Dubai Int Acad City, SP Jain Sch Global Management, POB 502345, Dubai, U Arab Emirates
关键词
stroke prediction; machine learning modeling; classification models; imbalanced dataset;
D O I
10.3390/a17120571
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stroke prediction is a vital research area due to its significant implications for public health. This comparative study offers a detailed evaluation of algorithmic methodologies and outcomes from three recent prominent studies on stroke prediction. Ivanov et al. tackled issues of imbalanced datasets and algorithmic bias using deep learning techniques, achieving notable results with a 98% accuracy and a 97% recall rate. They utilized resampling methods to balance the classes and advanced imputation techniques to handle missing data, underscoring the critical role of data preprocessing in enhancing the performance of Support Vector Machines (SVMs). Hassan et al. addressed missing data and class imbalance using multiple imputations and the Synthetic Minority Oversampling Technique (SMOTE). They developed a Dense Stacking Ensemble (DSE) model with over 96% accuracy. Their results underscore the efficiency of ensemble learning techniques and imputation for handling imbalanced datasets in stroke prediction. Bathla et al. employed various classifiers and feature selection techniques, including SMOTE, for class balancing. Their Random Forest (RF) classifier, combined with Feature Importance (FI) selection, achieved an accuracy of 97.17%, illustrating the positive impact of RF and relevant feature selection on model performance. A comparative analysis indicated that Ivanov et al.'s method achieved the highest accuracy rate. However, the studies collectively highlight that the choice of models and techniques for stroke prediction should be tailored to the specific characteristics of the dataset used. This study emphasizes the importance of effective data management and model selection in enhancing predictive performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Comparative Study of Machine Learning Modeling for Unsteady Aerodynamics
    Alkhedher, Mohammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 1901 - 1920
  • [22] Machine learning for Gravity Spy: Glitch classification and dataset
    Bahaadini, S.
    Noroozi, V.
    Rohani, N.
    Coughlin, S.
    Zevin, M.
    Smith, J. R.
    Kalogera, V.
    Katsaggelos, A.
    INFORMATION SCIENCES, 2018, 444 : 172 - 186
  • [23] Comparative Study of Machine Learning Algorithms for Glacier facies Classification
    Panwar, Ruby
    Kumar, Amit
    Singh, Gulab
    2024 IEEE Mediterranean and Middle-East Geoscience and Remote Sensing Symposium, M2GARSS 2024 - Proceedings, 2024, : 296 - 299
  • [24] Classification of Diseases Using Machine Learning Algorithms: A Comparative Study
    Moreno-Ibarra, Marco-Antonio
    Villuendas-Rey, Yenny
    Lytras, Miltiadis D.
    Yanez-Marquez, Cornelio
    Salgado-Ramirez, Julio-Cesar
    MATHEMATICS, 2021, 9 (15)
  • [25] A comparative study of machine learning algorithms for physiological signal classification
    Biagetti, Giorgio
    Crippa, Paolo
    Falaschetti, Laura
    Tanoni, Giulia
    Turchetti, Claudio
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 1977 - 1984
  • [26] Machine Learning Algorithms for Privacy Policy Classification: A Comparative Study
    Alshamsan, Abdullah R.
    Chaudhry, Shafique A.
    2022 2ND IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND ARTIFICIAL INTELLIGENCE (SEAI 2022), 2022, : 214 - 219
  • [27] Classification of Heart Failure Using Machine Learning: A Comparative Study
    Chulde-Fernandez, Bryan
    Enriquez-Ortega, Denisse
    Guevara, Cesar
    Navas, Paulo
    Tirado-Espin, Andres
    Vizcaino-Imacana, Paulina
    Villalba-Meneses, Fernando
    Cadena-Morejon, Carolina
    Almeida-Galarraga, Diego
    Acosta-Vargas, Patricia
    LIFE-BASEL, 2025, 15 (03):
  • [28] A Comparative Study of the Method to Rapid Identification of the Mural Pigments by Combining LIBS-Based Dataset and Machine Learning Methods
    Sun, Duixiong
    Zhang, Yiming
    Yin, Yaopeng
    Zhang, Zhao
    Qian, Hengli
    Wang, Yarui
    Yu, Zongren
    Su, Bomin
    Dong, Chenzhong
    Su, Maogen
    CHEMOSENSORS, 2022, 10 (10)
  • [29] Classification of Hatchery Eggs Using a Machine Learning Algorithm Based on Image Processing Methods: A Comparative Study
    A, Celik
    Tekin, E.
    BRAZILIAN JOURNAL OF POULTRY SCIENCE, 2024, 26 (02)
  • [30] Sugarcane leaf dataset: A dataset for disease detection and classification for machine learning applications
    Thite, Sandip
    Suryawanshi, Yogesh
    Patil, Kailas
    Chumchu, Prawit
    DATA IN BRIEF, 2024, 53