Estimating COVID-19 cases on a university campus based on Wastewater Surveillance using machine learning regression models

被引:6
|
作者
Senaratna, Kavindra Yohan Kuhatheva [1 ]
Bhatia, Sumedha [1 ]
Giek, Goh Shin [2 ]
Lim, Chun Min Benjamin [1 ]
Gangesh, G. Reuben [1 ]
Peng, Lim Cheh [3 ]
Wong, Judith Chui Ching [4 ]
Ng, Lee Ching [4 ,5 ]
Gin, Karina Yew-Hoong [1 ,2 ]
机构
[1] Natl Univ Singapore, NUS Environm Res Inst, T Lab Bldg,5A Engn Dr 1, Singapore 117411, Singapore
[2] Natl Univ Singapore, Dept Civil & Environm Engn, Engn Dr 2, Singapore 117576, Singapore
[3] Natl Univ Singapore, Off Risk Management & Compliance, Singapore 119077, Singapore
[4] Natl Environm Agcy, Environm Hlth Inst, 11 Biopolis Way,06-05-08, Singapore 138667, Singapore
[5] Nanyang Technol Univ, Sch Biol Sci, 60 Nanyang Dr, Singapore 637551, Singapore
关键词
Wastewater surveillance; COVID-19; University; Machine learning regression; Multivariate model;
D O I
10.1016/j.scitotenv.2023.167709
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Wastewater Surveillance (WS) is a crucial tool in the management of COVID-19 pandemic. The surveillance is based on enumerating SARS-CoV-2 RNA concentrations in the community's sewage. In this study, we used WS data to develop a regression model for estimating the number of active COVID-19 cases on a university campus. Eight univariate and multivariate regression model types i.e. Linear Regression (LM), Polynomial Regression (PR), Generalised Additive Model (GAM), Locally Estimated Scatterplot Smoothing Regression (LOESS), K Nearest Neighbours Regression (KNN), Support Vector Regression (SVR), Artificial Neural Networks (ANN) and Random Forest (RF) were developed and compared. We found that the multivariate RF regression model, was the most appropriate for predicting the prevalence of COVID-19 infections at both a campus level and hostel-level. We also found that smoothing the normalised SARS-CoV-2 data and employing multivariate modelling, using student population as a second independent variable, significantly improved the performance of the models. The final RF campus level model showed good accuracy when tested using previously unseen data; correlation coefficient of 0.97 and a mean absolute error (MAE) of 20 %. In summary, our non-intrusive approach has the ability to complement projections based on clinical tests, facilitating timely follow-up and response.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] COVID-19 Future Forecasting Using Supervised Machine Learning Models
    Rustam, Furqan
    Reshi, Aijaz Ahmad
    Mehmood, Arif
    Ullah, Saleem
    On, Byung-Won
    Aslam, Waqar
    Choi, Gyu Sang
    IEEE ACCESS, 2020, 8 (08): : 101489 - 101499
  • [32] The COVID-19 pandemic: prediction study based on machine learning models
    Zohair Malki
    El-Sayed Atlam
    Ashraf Ewis
    Guesh Dagnew
    Osama A. Ghoneim
    Abdallah A. Mohamed
    Mohamed M. Abdel-Daim
    Ibrahim Gad
    Environmental Science and Pollution Research, 2021, 28 : 40496 - 40506
  • [33] The COVID-19 pandemic: prediction study based on machine learning models
    Malki, Zohair
    Atlam, El-Sayed
    Ewis, Ashraf
    Dagnew, Guesh
    Ghoneim, Osama A.
    Mohamed, Abdallah A.
    Abdel-Daim, Mohamed M.
    Gad, Ibrahim
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2021, 28 (30) : 40496 - 40506
  • [34] Survey of COVID-19 isolation cases at a major university campus in the United States
    Brickhouse, Mark
    Gassen, Jeffrey
    Ryan, Benjamin J.
    Muehlenbein, Michael P.
    JOURNAL OF AMERICAN COLLEGE HEALTH, 2024, 72 (09) : 3210 - 3215
  • [35] Using different machine learning models to classify patients into mild and severe cases of COVID-19 based on multivariate blood testing
    Zhang, Rui-kun
    Xiao, Qi
    Zhu, Sheng-lang
    Lin, Hai-yan
    Tang, Ming
    JOURNAL OF MEDICAL VIROLOGY, 2022, 94 (01) : 357 - 365
  • [36] Predicting University Campus Parking Demand Using Machine Learning Models
    Paudel, Sohil
    Vechione, Matthew
    Gurbuz, Okan
    TRANSPORTATION RESEARCH RECORD, 2024, 2678 (06) : 14 - 26
  • [37] Wastewater-based surveillance models for COVID-19: A focused review on spatio-temporal models
    Torabi, Fatemeh
    Li, Guangquan
    Mole, Callum
    Nicholson, George
    Rowlingson, Barry
    Smith, Camila Rangel
    Jersakova, Radka
    Diggle, Peter J.
    Blangiardo, Marta
    HELIYON, 2023, 9 (11)
  • [38] Integrated Surveillance System for Controlling COVID-19 on a University Campus, 2020-2021
    Meredith, Genevive R.
    Osman, Marwan
    Cazer, Casey L.
    Cummings, Kevin J.
    Hecht, Jason
    Madsen, Cecelia G.
    Santacrose, Laura B.
    Dubovi, Abigail S.
    Clarkberg, Marin
    Johnson, Tyler
    Fitzpatrick, Maria D.
    Parrilla, Lara
    Li, Yihong
    Francis, Lorraine
    Weisfuse, Isaac B.
    Travis, AlexanderJ
    Jones, Anne C.
    Warnick, Lorin D.
    Koretzky, Gary A.
    AMERICAN JOURNAL OF PUBLIC HEALTH, 2022, 112 (07) : 980 - 984
  • [39] Predicting reduction of COVID-19 cases in India Using Machine Learning Algorithm
    Welekar, Rashmi
    Tapase, Sharvari
    Bajaj, Shubhi
    Pande, Isha
    Verma, Abhishek
    Katpatal, Ashutosh
    Mishra, Vaibhav
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 189 - 192
  • [40] The Number of Confirmed Cases of Covid-19 by using Machine Learning: Methods and Challenges
    Ahmad, Amir
    Garhwal, Sunita
    Ray, Santosh Kumar
    Kumar, Gagan
    Malebary, Sharaf Jameel
    Barukab, Omar Mohammed
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2021, 28 (04) : 2645 - 2653