Statistical Analysis and Machine Learning Prediction of Disease Outcomes for COVID-19 and Pneumonia Patients

被引:10
|
作者
Zhao, Yu
Zhang, Rusen
Zhong, Yi
Wang, Jingjing
Weng, Zuquan
Luo, Heng
Chen, Cunrong
机构
[1] College of Computer and Data Science, Fuzhou University, Fuzhou
[2] Centre for Big Data Research in Burns and Trauma, Fuzhou University, Fuzhou
[3] Department of Cardiovascular Medicine, Affiliated Fuzhou First Hospital of Fujian Medical University, Fuzhou
[4] Department of Critical Care Medicine, Union Hospital of Fujian Medical University, Fuzhou
[5] College of Biological Science and Engineering, Fuzhou University, Fuzhou
[6] MetaNovas Biotech Inc, Foster City, CA
基金
中国国家自然科学基金;
关键词
the coronavirus disease 2019; pneumonia; statistical analysis; machine learning; attention mechanism; clinical indicators; COMMUNITY-ACQUIRED PNEUMONIA; TO-LYMPHOCYTE RATIO; IPRATROPIUM BROMIDE; DIAGNOSIS;
D O I
10.3389/fcimb.2022.838749
中图分类号
R392 [医学免疫学]; Q939.91 [免疫学];
学科分类号
100102 ;
摘要
The Coronavirus Disease 2019 (COVID-19) has spread all over the world and impacted many people's lives. The characteristics of COVID-19 and other types of pneumonia have both similarities and differences, which confused doctors initially to separate and understand them. Here we presented a retrospective analysis for both COVID-19 and other types of pneumonia by combining the COVID-19 clinical data, eICU and MIMIC-III databases. Machine learning models, including logistic regression, random forest, XGBoost and deep learning neural networks, were developed to predict the severity of COVID-19 infections as well as the mortality of pneumonia patients in intensive care units (ICU). Statistical analysis and feature interpretation, including the analysis of two-level attention mechanisms on both temporal and non-temporal features, were utilized to understand the associations between different clinical variables and disease outcomes. For the COVID-19 data, the XGBoost model obtained the best performance on the test set (AUROC = 1.000 and AUPRC = 0.833). On the MIMIC-III and eICU pneumonia datasets, our deep learning model (Bi-LSTM_Attn) was able to identify clinical variables associated with death of pneumonia patients (AUROC = 0.924 and AUPRC = 0.802 for 24-hour observation window and 12-hour prediction window). The results highlighted clinical indicators, such as the lymphocyte counts, that may help the doctors to predict the disease progression and outcomes for both COVID-19 and other types of pneumonia.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Predicting Clinical Outcomes in COVID-19 and Pneumonia Patients: A Machine Learning Approach
    Cai, Kaida
    Wang, Zhengyan
    Yang, Xiaofang
    Fu, Wenzhi
    Zhao, Xin
    VIRUSES-BASEL, 2024, 16 (10):
  • [2] Analysis on Prediction of COVID-19 with Machine Learning Algorithms
    Sathyaraj, R.
    Kanthavel, R.
    Cavaliere, Luigi Pio Leonardo
    Vyas, Sumit
    Maheswari, S.
    Gupta, Ravi Kumar
    Raja, M. Ramkumar
    Dhaya, R.
    Gupta, Mukesh Kumar
    Sengan, Sudhakar
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2022, 30 (SUPP01) : 67 - 82
  • [3] Analysis and Prediction of COVID-19 using Machine Learning
    Parthiban, M.
    Alphy, Anna
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [4] Prediction of Disease Stage by Machine Learning Classification Methods for Covid-19 Patients
    Dogancay, Merve
    Oruc, Ozlem Ege
    Altin, Zeynep
    Sirlanci, Melike
    AUSTRIAN JOURNAL OF STATISTICS, 2024,
  • [5] COVID-19 from symptoms to prediction: A statistical and machine learning approach
    Fakieh, Bahjat
    Saleem, Farrukh
    Computers in Biology and Medicine, 2024, 182
  • [6] Statistical analysis of blood characteristics of COVID-19 patients and their survival or death prediction using machine learning algorithms
    Mazloumi, Rahil
    Abazari, Seyed Reza
    Nafarieh, Farnaz
    Aghsami, Amir
    Jolai, Fariborz
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17): : 14729 - 14743
  • [7] Statistical analysis of blood characteristics of COVID-19 patients and their survival or death prediction using machine learning algorithms
    Rahil Mazloumi
    Seyed Reza Abazari
    Farnaz Nafarieh
    Amir Aghsami
    Fariborz Jolai
    Neural Computing and Applications, 2022, 34 : 14729 - 14743
  • [8] Budget constrained machine learning for early prediction of adverse outcomes for COVID-19 patients
    Sam Nguyen
    Ryan Chan
    Jose Cadena
    Braden Soper
    Paul Kiszka
    Lucas Womack
    Mark Work
    Joan M. Duggan
    Steven T. Haller
    Jennifer A. Hanrahan
    David J. Kennedy
    Deepa Mukundan
    Priyadip Ray
    Scientific Reports, 11
  • [9] Budget constrained machine learning for early prediction of adverse outcomes for COVID-19 patients
    Sam Nguyen
    Chan, Ryan
    Cadena, Jose
    Soper, Braden
    Kiszka, Paul
    Womack, Lucas
    Work, Mark
    Duggan, Joan M.
    Haller, Steven T.
    Hanrahan, Jennifer A.
    Kennedy, David J.
    Mukundan, Deepa
    Ray, Priyadip
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [10] Machine Learning and Statistical Modelling for Prediction of Novel COVID-19 Patients Case Study: Jordan
    Fayyoumi, Ebaa
    Idwan, Sahar
    AboShindi, Heba
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 122 - 126