A new hybrid ensemble machine-learning model for severity risk assessment and post-COVID prediction system

被引:15
|
作者
Shakhovska, Natalya [1 ]
Yakovyna, Vitaliy [1 ,2 ]
Chopyak, Valentyna [3 ]
机构
[1] Lviv Polytech Natl Univ, Dept Artificial Intelligence, UA-79013 Lvov, Ukraine
[2] Univ Warmia & Mazury, Fac Math & Comp Sci, PL-10719 Olsztyn, Poland
[3] Danylo Halytskyi Lviv Natl Univ, Dept Clin Immunol & Allergol, UA-79010 Lvov, Ukraine
基金
新加坡国家研究基金会;
关键词
COVID-19; severity prediction; machine learning; ensemble classification; biomarkers; FEATURE-SELECTION;
D O I
10.3934/mbe.2022285
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Starting from December 2019, the COVID-19 pandemic has globally strained medical resources and caused significant mortality. It is commonly recognized that the severity of SARS-CoV-2 disease depends on both the comorbidity and the state of the patient's immune system, which is reflected in several biomarkers. The development of early diagnosis and disease severity prediction methods can reduce the burden on the health care system and increase the effectiveness of treatment and rehabilitation of patients with severe cases. This study aims to develop and validate an ensemble machine-learning model based on clinical and immunological features for severity risk assessment and post-COVID rehabilitation duration for SARS-CoV-2 patients. The dataset consisting of 35 features and 122 instances was collected from Lviv regional rehabilitation center. The dataset contains age, gender, weight, height, BMI, CAT, 6-minute walking test, pulse, external respiration function, oxygen saturation, and 15 immunological markers used to predict the relationship between disease duration and biomarkers using the machine learning approach. The predictions are assessed through an area under the receiver-operating curve, classification accuracy, precision, recall, and F1 score performance metrics. A new hybrid ensemble feature selection model for a post-COVID prediction system is proposed as an automatic feature cut-off rank identifier. A three-layer high accuracy stacking ensemble classification model for intelligent analysis of short medical datasets is presented. Together with weak predictors, the associative rules allowed improving the classification quality. The proposed ensemble allows using a random forest model as an aggregator for weak repressors' results generalization. The performance of the three-layer stacking ensemble classification model (AUC 0.978; CA 0.920; F1 score 0.921; precision 0.924; recall 0.920) was higher than five machine learning models, viz. tree algorithm with forward pruning; Naive Bayes classifier; support vector machine with RBF kernel; logistic regression, and a calibrated learner with sigmoid function and decision threshold optimization. Aging-related biomarkers, viz. CD3+, CD4+, CD8+, CD22+ were examined to predict post-COVID rehabilitation duration. The best accuracy was reached in the case of the support vector machine with the linear kernel (MAPE = 0.0787) and random forest classifier (RMSE = 1.822). The proposed three -layer stacking ensemble classification model predicted SARS-CoV-2 disease severity based on the cytokines and physiological biomarkers. The results point out that changes in studied biomarkers associated with the severity of the disease can be used to monitor the severity and forecast the rehabilitation duration.
引用
收藏
页码:6102 / 6123
页数:22
相关论文
共 50 条
  • [21] Multilayer hybrid ensemble machine learning model for analysis of Covid-19 vaccine sentiments
    Jain, Vipin
    Kashyap, Kanchan Lata
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 6307 - 6319
  • [22] Multilayer hybrid ensemble machine learning model for analysis of Covid-19 vaccine sentiments
    Jain, Vipin
    Kashyap, Kanchan Lata
    Journal of Intelligent and Fuzzy Systems, 2022, 43 (05): : 6307 - 6319
  • [23] A machine-learning parsimonious multivariable predictive model of mortality risk in patients with Covid-19
    Murri, Rita
    Lenkowicz, Jacopo
    Masciocchi, Carlotta
    Iacomini, Chiara
    Fantoni, Massimo
    Damiani, Andrea
    Marchetti, Antonio
    Sergi, Paolo Domenico Angelo
    Arcuri, Giovanni
    Cesario, Alfredo
    Patarnello, Stefano
    Antonelli, Massimo
    Bellantone, Rocco
    Bernabei, Roberto
    Boccia, Stefania
    Calabresi, Paolo
    Cambieri, Andrea
    Cauda, Roberto
    Colosimo, Cesare
    Crea, Filippo
    De Maria, Ruggero
    De Stefano, Valerio
    Franceschi, Francesco
    Gasbarrini, Antonio
    Parolini, Ornella
    Richeldi, Luca
    Sanguinetti, Maurizio
    Urbani, Andrea
    Zega, Maurizio
    Scambia, Giovanni
    Valentini, Vincenzo
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [24] A hybrid machine learning model based on ensemble methods for devices fault prediction in the wood industry
    Dahesh, Arezoo
    Tavakkoli-Moghaddam, Reza
    Wassan, Niaz
    Tajally, AmirReza
    Daneshi, Zahra
    Erfani-Jazi, Aseman
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [25] Enabling personalized perioperative risk prediction by using a machine-learning model based on preoperative data
    Graessner, Martin
    Jungwirth, Bettina
    Frank, Elke
    Schaller, Stefan Josef
    Kochs, Eberhard
    Ulm, Kurt
    Blobner, Manfred
    Ulm, Bernhard
    Podtschaske, Armin Horst
    Kagerbauer, Simone Maria
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [26] A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort
    Wu, Xuangao
    Park, Sunmin
    JOURNAL OF KOREAN MEDICAL SCIENCE, 2023, 38 (21)
  • [27] A hybrid super ensemble learning model for the early-stage prediction of diabetes risk
    Ayşe Doğru
    Selim Buyrukoğlu
    Murat Arı
    Medical & Biological Engineering & Computing, 2023, 61 : 785 - 797
  • [28] Enabling personalized perioperative risk prediction by using a machine-learning model based on preoperative data
    Martin Graeßner
    Bettina Jungwirth
    Elke Frank
    Stefan Josef Schaller
    Eberhard Kochs
    Kurt Ulm
    Manfred Blobner
    Bernhard Ulm
    Armin Horst Podtschaske
    Simone Maria Kagerbauer
    Scientific Reports, 13
  • [29] A hybrid ensemble machine learning model for discharge coefficient prediction of side orifices with different shapes
    Deng, Yangyu
    Zhang, Di
    Zhang, Dong
    Wu, Jian
    Liu, Yakun
    FLOW MEASUREMENT AND INSTRUMENTATION, 2023, 91
  • [30] A hybrid super ensemble learning model for the early-stage prediction of diabetes risk
    Dogru, Ayse
    Buyrukoglu, Selim
    Ari, Murat
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (03) : 785 - 797