Stroke Dataset Modeling: Comparative Study of Machine Learning Classification Methods

被引：1

作者：

Kitova, Kalina ^{[1
]}

Ivanov, Ivan ^{[1
]}

Hooper, Vincent ^{[2
]}

机构：

[1] Sofia Univ St Kl Ohridski, Fac Econ & Business Adm, Sofia 1113, Bulgaria

[2] Dubai Int Acad City, SP Jain Sch Global Management, POB 502345, Dubai, U Arab Emirates

来源：

ALGORITHMS | 2024年 / 17卷 / 12期

关键词：

stroke prediction; machine learning modeling; classification models; imbalanced dataset;

D O I：

10.3390/a17120571

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Stroke prediction is a vital research area due to its significant implications for public health. This comparative study offers a detailed evaluation of algorithmic methodologies and outcomes from three recent prominent studies on stroke prediction. Ivanov et al. tackled issues of imbalanced datasets and algorithmic bias using deep learning techniques, achieving notable results with a 98% accuracy and a 97% recall rate. They utilized resampling methods to balance the classes and advanced imputation techniques to handle missing data, underscoring the critical role of data preprocessing in enhancing the performance of Support Vector Machines (SVMs). Hassan et al. addressed missing data and class imbalance using multiple imputations and the Synthetic Minority Oversampling Technique (SMOTE). They developed a Dense Stacking Ensemble (DSE) model with over 96% accuracy. Their results underscore the efficiency of ensemble learning techniques and imputation for handling imbalanced datasets in stroke prediction. Bathla et al. employed various classifiers and feature selection techniques, including SMOTE, for class balancing. Their Random Forest (RF) classifier, combined with Feature Importance (FI) selection, achieved an accuracy of 97.17%, illustrating the positive impact of RF and relevant feature selection on model performance. A comparative analysis indicated that Ivanov et al.'s method achieved the highest accuracy rate. However, the studies collectively highlight that the choice of models and techniques for stroke prediction should be tailored to the specific characteristics of the dataset used. This study emphasizes the importance of effective data management and model selection in enhancing predictive performance.

引用

页数：16

共 50 条

[41] A Comparative Study of Machine Learning Methods for Traffic Sign Recognition
Schuszter, Ioan Cristian
2017 19TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2017), 2017, : 389 - 392
[42] Comparative study of machine learning methods for modeling associations between risk factors and future dementia cases
Vaka Valsdóttir
María K. Jónsdóttir
Brynja Björk Magnúsdóttir
Milan Chang
Yi-Han Hu
Vilmundur Gudnason
Lenore J. Launer
Hlynur Stefánsson
GeroScience, 2024, 46 : 737 - 750
[43] Comparative study of machine learning methods for modeling associations between risk factors and future dementia cases
Valsdottir, Vaka
Jonsdottir, Maria K.
Magnusdottir, Brynja Bjoerk
Chang, Milan
Hu, Yi-Han
Gudnason, Vilmundur
Launer, Lenore J.
Stefansson, Hlynur
GEROSCIENCE, 2024, 46 (01) : 737 - 750
[44] Improving the accuracy of multiclass classification in machine learning: A case study in a cell signaling dataset
Pablo Gonzalez-Perez, Pedro
Eduardo Sanchez-Gutierrez, Maximo
INTELLIGENT DATA ANALYSIS, 2022, 26 (02) : 481 - 500
[45] A Comparative Study of Machine Learning Methods for Computational Modeling of the Selective Laser Melting Additive Manufacturing Process
Chaudhry, Shubham
Soulaimani, Azzeddine
APPLIED SCIENCES-BASEL, 2022, 12 (05):
[46] A Comparative Study of Local Net Modeling Using Machine Learning
Melchert, Jackson
Zhang, Boyu
Davoodi, Azadeh
PROCEEDINGS OF THE 2018 GREAT LAKES SYMPOSIUM ON VLSI (GLSVLSI'18), 2018, : 273 - 278
[47] Comparative Study of different Lazy Learning Associative Classification Methods
Tamrakar, Preeti
Ibrahim, Syed S. P.
2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 370 - 376
[48] An Exploratory Study in Classification Methods for Patients' Dataset
Mutalib, Sofianita
Ali, Nor Azlin
Rahman, Shuzlina Abdul
Mohamed, Azlinah
2009 2ND CONFERENCE ON DATA MINING AND OPTIMIZATION, 2009, : 86 - 90
[49] Classification of Intrusion Detection Dataset using machine learning Approaches
Subramanyam, Doodipalli
PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES, ELECTRONICS AND MECHANICAL SYSTEMS (CTEMS), 2018, : 280 - 283
[50] A Comparative Study on the Ship Classification Performance of the Deep Learning Model According to Dataset Difference
Moon, SungWon
Kim, YoonHyung
Nam, Dowon
Yoo, Wonyoung
Kim, Changick
2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1428 - 1430

← 1 2 3 4 5 →