Random forest-based prediction of stroke outcome

被引:40
|
作者
Fernandez-Lozano, Carlos [1 ,2 ]
Hervella, Pablo [3 ]
Mato-Abad, Virginia [4 ]
Rodriguez-Yanez, Manuel [5 ]
Suarez-Garaboa, Sonia [4 ]
Lopez-Dequidt, Iria [5 ]
Estany-Gestal, Ana [6 ]
Sobrino, Tomas [3 ]
Campos, Francisco [3 ]
Castillo, Jose [3 ]
Rodriguez-Yanez, Santiago [4 ]
Iglesias-Rey, Ramon [3 ]
机构
[1] Univ A Coruna, Dept Comp Sci & Informat Technol, CITIC Res Ctr Informat & Commun Technol, Fac Comp Sci, La Coruna, Spain
[2] Univ A Coruna, Grp Redes Neuronas Artificiales & Sistemas Adapta, Inst Invest Biomed A Coruna INIBIC, Complexo Hosp Univ A Coruna CHUAC,SERGAS, La Coruna, Spain
[3] Hlth Res Inst Santiago de Compostela IDIS, Clin Neurosci Res Lab LINC, Santiago De Compostela, Spain
[4] Univ A Coruna, Dept Comp Sci & Informat Technol, Software Engn Lab, Fac Comp Sci, Campus Elvina, La Coruna 15071, Spain
[5] Hosp Clin Univ, Hlth Res Inst Santiago de Compostela IDIS, Dept Neurol, Stroke Unit, Rua Travesa da Choupana S-N, Santiago De Compostela 15706, Spain
[6] Hlth Res Inst Santiago de Compostela IDIS, Unit Methodol Res, Santiago De Compostela, Spain
关键词
ACUTE ISCHEMIC-STROKE; LESION SEGMENTATION; FEATURE-SELECTION; TRENDS; CLASSIFICATION;
D O I
10.1038/s41598-021-89434-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We research into the clinical, biochemical and neuroimaging factors associated with the outcome of stroke patients to generate a predictive model using machine learning techniques for prediction of mortality and morbidity 3-months after admission. The dataset consisted of patients with ischemic stroke (IS) and non-traumatic intracerebral hemorrhage (ICH) admitted to Stroke Unit of a European Tertiary Hospital prospectively registered. We identified the main variables for machine learning Random Forest (RF), generating a predictive model that can estimate patient mortality/morbidity according to the following groups: (1) IS+ICH, (2) IS, and (3) ICH. A total of 6022 patients were included: 4922 (mean age 71.9 +/- 13.8 years) with IS and 1100 (mean age 73.3 +/- 13.1 years) with ICH. NIHSS at 24, 48 h and axillary temperature at admission were the most important variables to consider for evolution of patients at 3-months. IS+ICH group was the most stable for mortality prediction [0.904 +/- 0.025 of area under the receiver operating characteristics curve (AUC)]. IS group presented similar results, although variability between experiments was slightly higher (0.909 +/- 0.032 of AUC). ICH group was the one in which RF had more problems to make adequate predictions (0.9837 vs. 0.7104 of AUC). There were no major differences between IS and IS+ICH groups according to morbidity prediction (0.738 and 0.755 of AUC) but, after checking normality with a Shapiro Wilk test with the null hypothesis that the data follow a normal distribution, it was rejected with W=0.93546 (p-value<2.2e-16). Conditions required for a parametric test do not hold, and we performed a paired Wilcoxon Test assuming the null hypothesis that all the groups have the same performance. The null hypothesis was rejected with a value<2.2e-16, so there are statistical differences between IS and ICH groups. In conclusion, machine learning algorithms RF can be effectively used in stroke patients for long-term outcome prediction of mortality and morbidity.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Random Forest-Based Prediction Models for Assessing Cardiovascular Disease Risk: Integrating Clinical and Genetic Factors
    Swarnkar, Suman Kumar
    Parthiban, M.
    Dixit, Rohit R.
    Kalavadekar, Prakash
    Patil, Sarika B.
    Ambhaikar, Asha
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 663 - 672
  • [42] SFCscoreRF: A Random Forest-Based Scoring Function for Improved Affinity Prediction of Protein-Ligand Complexes
    Zilian, David
    Sotriffer, Christoph A.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (08) : 1923 - 1933
  • [43] Random forest-based robust classification for lithographic hotspot detection
    Dewar, Rohit
    Barai, Samit
    Kumar, Pardeep
    Srinivasan, Babji
    Mohapatra, Nihar R.
    JOURNAL OF MICRO-NANOLITHOGRAPHY MEMS AND MOEMS, 2019, 18 (02):
  • [44] Random Forest-based Approach for Classifying Customers in Social CRM
    Lamrhari, Soumaya
    Elghazi, Hamid
    El Faker, Abdellatif
    2020 IEEE INTERNATIONAL CONFERENCE ON TECHNOLOGY MANAGEMENT, OPERATIONS AND DECISIONS (ICTMOD), 2020,
  • [45] A Random Forest-Based Leaf Classification Using Multiple Features
    Hazra, Dipankar
    Bhattacharyya, Debnath
    Kim, Tai-hoon
    Advances in Intelligent Systems and Computing, 2021, 1280 : 227 - 239
  • [46] Random Forest-based Thermal Effect Prediction for Clock Tree Synthesis in 3D-IC
    Jin, Myeongwoo
    Oh, Deokkeun
    Kim, Juho
    JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, 2023, 23 (03) : 149 - 161
  • [47] Improving atmospheric corrosion prediction through key environmental factor identification by random forest-based model
    Zhi, Yuanjie
    Jin, Zhihui
    Lu, Lin
    Yang, Tao
    Zhou, Deyun
    Pei, Zibo
    Wu, Dequan
    Fu, Dongmei
    Zhang, Dawei
    Li, Xiaogang
    CORROSION SCIENCE, 2021, 178
  • [48] A random forest-based model for crypto asset forecasts in futures markets with out-of-sample prediction
    Orte, Francisco
    Mira, Jose
    Jesus Sanchez, Maria
    Solana, Pablo
    RESEARCH IN INTERNATIONAL BUSINESS AND FINANCE, 2023, 64
  • [49] A random forest-based model for the prediction of construction-stage carbon emissions at the early design stage
    Fang, Yuan
    Lu, Xiaoqing
    Li, Hongyang
    Li, Hongyang (lihy@hhu.edu.cn), 1600, Elsevier Ltd (328):
  • [50] An Alternative to Laboratory Testing: Random Forest-Based Water Quality Prediction Framework for Inland and Nearshore Water Bodies
    Xu, Jianlong
    Xu, Zhuo
    Kuang, Jianjun
    Lin, Che
    Xiao, Lianghong
    Huang, Xingshan
    Zhang, Yufeng
    WATER, 2021, 13 (22)