Enhanced SARS-CoV-2 case prediction using public health data and machine learning models

被引:0
|
作者
Price, Bradley S. [1 ,2 ,8 ]
Khodaverdi, Maryam [2 ]
Hendricks, Brian [2 ,3 ]
Smith, Gordon S. [2 ,3 ]
Kimble, Wes [2 ]
Halasz, Adam [4 ]
Guthrie, Sara [5 ]
Fraustino, Julia D. [6 ]
Hodder, Sally L. [2 ,7 ]
机构
[1] West Virginia Univ, Dept Management Informat Syst, Morgantown, WV 26505 USA
[2] West Virginia Clin & Translat Sci Inst, Morgantown, WV 26506 USA
[3] West Virginia Univ, Dept Epidemiol & Biostat, Morgantown, WV 26505 USA
[4] West Virginia Univ, Sch Math & Data Sci, Morgantown, WV 26506 USA
[5] West Virginia Univ, Dept Sociol & Anthropol, Morgantown, WV 26505 USA
[6] West Virginia Univ, Reed Coll Media, Dept Strateg Commun, Morgantown, WV 26505 USA
[7] West Virginia Univ, Dept Med, Morgantown, WV 26506 USA
[8] West Virginia Univ, Dept Management Informat Syst, 83 Beechurst Ave, Morgantown, WV 26505 USA
关键词
public health data; machine learning; SARS-CoV-2; prediction; NEURAL-NETWORKS; DROPOUT; IMPACT;
D O I
10.1093/jamiaopen/ooae014
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives The goal of this study is to propose and test a scalable framework for machine learning (ML) algorithms to predict near-term severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) cases by incorporating and evaluating the impact of real-time dynamic public health data.Materials and Methods Data used in this study include patient-level results, procurement, and location information of all SARS-CoV-2 tests reported in West Virginia as part of their mandatory reporting system from January 2021 to March 2022. We propose a method for incorporating and comparing widely available public health metrics inside of a ML framework, specifically a long-short-term memory network, to forecast SARS-CoV-2 cases across various feature sets.Results Our approach provides better prediction of localized case counts and indicates the impact of the dynamic elements of the pandemic on predictions, such as the influence of the mixture of viral variants in the population and variable testing and vaccination rates during various eras of the pandemic.Discussion Utilizing real-time public health metrics, including estimated Rt from multiple SARS-CoV-2 variants, vaccination rates, and testing information, provided a significant increase in the accuracy of the model during the Omicron and Delta period, thus providing more precise forecasting of daily case counts at the county level. This work provides insights on the influence of various features on predictive performance in rural and non-rural areas.Conclusion Our proposed framework incorporates available public health metrics with operational data on the impact of testing, vaccination, and current viral variant mixtures in the population to provide a foundation for combining dynamic public health metrics and ML models to deliver forecasting and insights in healthcare domains. It also shows the importance of developing and deploying ML frameworks in rural settings. This study aims to propose and test a scalable framework for machine learning (ML) algorithms to predict near-term severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) cases by county by incorporating and evaluating the impact of real-time dynamic public health data. Data used in this study include patient-level results, procurement, and location information of all SARS-CoV-2 tests reported in West Virginia as part of their mandatory reporting system from January 2021 to March 2022. We propose a method for incorporating and comparing widely available public health metrics inside of a ML framework, specifically a long-short-term memory network, to forecast SARS-CoV-2 cases across various feature sets. Our approach provides better prediction of localized case counts, recommendation of locations of outbreaks, and indicates the impact of the dynamic elements of the pandemic on predictions, such as the influence of the mixture of viral variants in the population and variable testing and vaccination rates during various eras of the pandemic. Incorporating available public health metrics with operational data on the impact of testing, vaccination, and current viral variant mixtures in the population provides a foundation for combining dynamic public health metrics and ML models to deliver improved forecasting and insights in healthcare domains. This approach provides a model for utilizing ML to forecast, deploy, and understand the impact of public health data during coronavirus disease and other pandemics.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Prediction of antigenic peptides of SARS-CoV-2 pathogen using machine learning
    Bukhari, Syed Nisar Hussain
    Ogudo, Kingsley A.
    [J]. PEERJ COMPUTER SCIENCE, 2024, 10
  • [2] Feature selection for effective prediction of SARS-COV-2 using machine learning
    Punacha, Gagan
    Adiga, Rama
    [J]. GENES & GENOMICS, 2024, 46 (01) : 95 - 112
  • [3] Feature selection for effective prediction of SARS-COV-2 using machine learning
    Gagan Punacha
    Rama Adiga
    [J]. Genes & Genomics, 2024, 46 : 341 - 354
  • [4] Machine Learning Models Identify Inhibitors of SARS-CoV-2
    Gawriljuk, Victor O.
    Zin, Phyo Phyo Kyaw
    Puhl, Ana C.
    Zorn, Kimberley M.
    Foil, Daniel H.
    Lane, Thomas R.
    Hurst, Brett
    Tavella, Tatyana Almeida
    Maranhao Costa, Fabio Trindade
    Lakshmanane, Premkumar
    Bernatchez, Jean
    Godoy, Andre S.
    Oliva, Glaucius
    Siqueira-Neto, Jair L.
    Madrid, Peter B.
    Ekins, Sean
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (09) : 4224 - 4235
  • [6] Classification of SARS-CoV-2 and non-SARS-CoV-2 using machine learning algorithms
    Singh, Om Prakash
    Vallejo, Marta
    El-Badawy, Ismail M.
    Aysha, Ali
    Madhanagopal, Jagannathan
    Faudzi, Ahmad Athif Mohd
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 136
  • [7] Effect of Neurological Manifestations on SARS-CoV-2 Infection Prognosis using Machine Learning Models
    Thepmankorn, Parisorn
    Heshmati, Keyvan
    Souayah, Sami
    Shafiq, Basit
    Adam, Tarek
    Adam, Nabil
    Souayah, Nizar
    [J]. NEUROLOGY, 2021, 96 (15)
  • [8] Machine Learning for Prediction of Patients on Hemodialysis with an Undetected SARS-CoV-2 Infection
    Monaghan, Caitlin K.
    Larkin, John W.
    Chaudhuri, Sheetal
    Han, Hao
    Jiao, Yue
    Bermudez, Kristine M.
    Weinhandl, Eric D.
    Dahne-Steuber, Ines A.
    Belmonte, Kathleen
    Neri, Luca
    Kotanko, Peter
    Kooman, Jeroen P.
    Hymes, Jeffrey L.
    Kossmann, Robert J.
    Usvyat, Len A.
    Maddux, Franklin W.
    [J]. KIDNEY360, 2021, 2 (03): : 456 - 468
  • [9] A review of SARS-CoV-2 drug repurposing: databases and machine learning models
    Elkashlan, Marim
    Ahmad, Rahaf M.
    Hajar, Malak
    Al Jasmi, Fatma
    Corchado, Juan Manuel
    Nasarudin, Nurul Athirah
    Mohamad, Mohd Saberi
    [J]. FRONTIERS IN PHARMACOLOGY, 2023, 14
  • [10] Machine learning application for the prediction of SARS-CoV-2 infection using blood tests and chest radiograph
    Du, Richard
    Tsougenis, Efstratios D.
    Ho, Joshua W. K.
    Chan, Joyce K. Y.
    Chiu, Keith W. H.
    Fang, Benjamin X. H.
    Ng, Ming Yen
    Leung, Siu-Ting
    Lo, Christine S. Y.
    Wong, Ho-Yuen F.
    Lam, Hiu-Yin S.
    Chiu, Long-Fung J.
    So, Tiffany Y.
    Wong, Ka Tak
    Wong, Yiu Chung, I
    Yu, Kevin
    Yeung, Yiu-Cheong
    Chik, Thomas
    Pang, Joanna W. K.
    Wai, Abraham Ka-chung
    Kuo, Michael D.
    Lam, Tina P. W.
    Khong, Pek-Lan
    Cheung, Ngai-Tseung
    Vardhanabhuti, Varut
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)