Random forest-based prediction of stroke outcome

被引:40
|
作者
Fernandez-Lozano, Carlos [1 ,2 ]
Hervella, Pablo [3 ]
Mato-Abad, Virginia [4 ]
Rodriguez-Yanez, Manuel [5 ]
Suarez-Garaboa, Sonia [4 ]
Lopez-Dequidt, Iria [5 ]
Estany-Gestal, Ana [6 ]
Sobrino, Tomas [3 ]
Campos, Francisco [3 ]
Castillo, Jose [3 ]
Rodriguez-Yanez, Santiago [4 ]
Iglesias-Rey, Ramon [3 ]
机构
[1] Univ A Coruna, Dept Comp Sci & Informat Technol, CITIC Res Ctr Informat & Commun Technol, Fac Comp Sci, La Coruna, Spain
[2] Univ A Coruna, Grp Redes Neuronas Artificiales & Sistemas Adapta, Inst Invest Biomed A Coruna INIBIC, Complexo Hosp Univ A Coruna CHUAC,SERGAS, La Coruna, Spain
[3] Hlth Res Inst Santiago de Compostela IDIS, Clin Neurosci Res Lab LINC, Santiago De Compostela, Spain
[4] Univ A Coruna, Dept Comp Sci & Informat Technol, Software Engn Lab, Fac Comp Sci, Campus Elvina, La Coruna 15071, Spain
[5] Hosp Clin Univ, Hlth Res Inst Santiago de Compostela IDIS, Dept Neurol, Stroke Unit, Rua Travesa da Choupana S-N, Santiago De Compostela 15706, Spain
[6] Hlth Res Inst Santiago de Compostela IDIS, Unit Methodol Res, Santiago De Compostela, Spain
关键词
ACUTE ISCHEMIC-STROKE; LESION SEGMENTATION; FEATURE-SELECTION; TRENDS; CLASSIFICATION;
D O I
10.1038/s41598-021-89434-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stroke (IS) and non-traumatic intracerebral hemorrhage (ICH) admitted to Stroke Unit of a European Tertiary Hospital prospectively registered. We identified the main variables for machine learning Random Forest (RF), generating a predictive model that can estimate patient mortality/morbidity according to the following groups: (1) IS+ICH, (2) IS, and (3) ICH. A total of 6022 patients were included: 4922 (mean age 71.9 +/- 13.8 years) with IS and 1100 (mean age 73.3 +/- 13.1 years) with ICH. NIHSS at 24, 48 h and axillary temperature at admission were the most important variables to consider for evolution of patients at 3-months. IS+ICH group was the most stable for mortality prediction [0.904 +/- 0.025 of area under the receiver operating characteristics curve (AUC)]. IS group presented similar results, although variability between experiments was slightly higher (0.909 +/- 0.032 of AUC). ICH group was the one in which RF had more problems to make adequate predictions (0.9837 vs. 0.7104 of AUC). There were no major differences between IS and IS+ICH groups according to morbidity prediction (0.738 and 0.755 of AUC) but, after checking normality with a Shapiro Wilk test with the null hypothesis that the data follow a normal distribution, it was rejected with W=0.93546 (p-value<2.2e-16). Conditions required for a parametric test do not hold, and we performed a paired Wilcoxon Test assuming the null hypothesis that all the groups have the same performance. The null hypothesis was rejected with a value<2.2e-16, so there are statistical differences between IS and ICH groups. In conclusion, machine learning algorithms RF can be effectively used in stroke patients for long-term outcome prediction of mortality and morbidity.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A Random Forest-Based Method for Predicting Borehole Trajectories
    Yan, Baoyong
    Zhang, Xiantao
    Tang, Chengxu
    Wang, Xiao
    Yang, Yifei
    Xu, Weihua
    MATHEMATICS, 2023, 11 (06)
  • [22] Random Forest-Based Prediction of Acute Respiratory Distress Syndrome in Patients Undergoing Cardiac Surgery
    Wang, Wei
    Li, Lina
    Gu, Hongjun
    Chen, Yanqing
    Zhen, Yumei
    Dong, Zhaorui
    HEART SURGERY FORUM, 2022, 25 (06): : E854 - E859
  • [23] Random Forest-based feature selection for emotion recognition
    Gharsalli, Sonia
    Emile, Bruno
    Laurent, Helene
    Desquesnes, Xavier
    Vivet, Damien
    5TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, THEORY, TOOLS AND APPLICATIONS 2015, 2015, : 268 - 272
  • [24] A Random Forest-Based Accuracy Prediction Model for Augmented Biofeedback in a Precision Shooting Training System
    Guo, Junqi
    Yang, Lan
    Umek, Anton
    Bie, Rongfang
    Tomazic, Saso
    Kos, Anton
    SENSORS, 2020, 20 (16) : 1 - 16
  • [25] tRForest: a novel random forest-based algorithm for tRNA-derived fragment target prediction
    Parikh, Rohan
    Wilson, Briana
    Marrah, Laine
    Su, Zhangli
    Saha, Shekhar
    Kumar, Pankaj
    Huang, Fenix
    Dutta, Anindya
    NAR GENOMICS AND BIOINFORMATICS, 2022, 4 (02)
  • [26] Prediction of cell penetrating peptides and their uptake efficiency using random forest-based feature selections
    Liu, Peng
    Ding, Yijie
    Rong, Ying
    Chen, Dong
    AICHE JOURNAL, 2022, 68 (09)
  • [27] Robustness of Random Forest-based gene selection methods
    Miron Bartosz Kursa
    BMC Bioinformatics, 15
  • [28] Random Forest-Based Prediction Model for Stiffness Degradation of Offshore Wind Farm Submarine Soil
    He, Ben
    Lin, Mingbao
    Yu, Xinran
    Zhang, Zhishuai
    Dai, Song
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2025, 13 (01)
  • [29] Improving Landslides Prediction: Meteorological Data Preprocessing Using Random Forest-Based Feature Selection
    Guerrero Rodriguez, Byron
    Salvador Meneses, Jaime
    Garcia-Rodriguez, Jose
    16TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2021), 2022, 1401 : 379 - 387
  • [30] Random Forest-Based Prediction of Enantioselectivity in Thiol Addition to Imines Catalyzed by Chiral Phosphoric Acids
    Yu, Xinliang
    Zhang, Zekai
    CHEMISTRYSELECT, 2024, 9 (40):