Estimating COVID-19 cases on a university campus based on Wastewater Surveillance using machine learning regression models

被引:6
|
作者
Senaratna, Kavindra Yohan Kuhatheva [1 ]
Bhatia, Sumedha [1 ]
Giek, Goh Shin [2 ]
Lim, Chun Min Benjamin [1 ]
Gangesh, G. Reuben [1 ]
Peng, Lim Cheh [3 ]
Wong, Judith Chui Ching [4 ]
Ng, Lee Ching [4 ,5 ]
Gin, Karina Yew-Hoong [1 ,2 ]
机构
[1] Natl Univ Singapore, NUS Environm Res Inst, T Lab Bldg,5A Engn Dr 1, Singapore 117411, Singapore
[2] Natl Univ Singapore, Dept Civil & Environm Engn, Engn Dr 2, Singapore 117576, Singapore
[3] Natl Univ Singapore, Off Risk Management & Compliance, Singapore 119077, Singapore
[4] Natl Environm Agcy, Environm Hlth Inst, 11 Biopolis Way,06-05-08, Singapore 138667, Singapore
[5] Nanyang Technol Univ, Sch Biol Sci, 60 Nanyang Dr, Singapore 637551, Singapore
关键词
Wastewater surveillance; COVID-19; University; Machine learning regression; Multivariate model;
D O I
10.1016/j.scitotenv.2023.167709
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Wastewater Surveillance (WS) is a crucial tool in the management of COVID-19 pandemic. The surveillance is based on enumerating SARS-CoV-2 RNA concentrations in the community's sewage. In this study, we used WS data to develop a regression model for estimating the number of active COVID-19 cases on a university campus. Eight univariate and multivariate regression model types i.e. Linear Regression (LM), Polynomial Regression (PR), Generalised Additive Model (GAM), Locally Estimated Scatterplot Smoothing Regression (LOESS), K Nearest Neighbours Regression (KNN), Support Vector Regression (SVR), Artificial Neural Networks (ANN) and Random Forest (RF) were developed and compared. We found that the multivariate RF regression model, was the most appropriate for predicting the prevalence of COVID-19 infections at both a campus level and hostel-level. We also found that smoothing the normalised SARS-CoV-2 data and employing multivariate modelling, using student population as a second independent variable, significantly improved the performance of the models. The final RF campus level model showed good accuracy when tested using previously unseen data; correlation coefficient of 0.97 and a mean absolute error (MAE) of 20 %. In summary, our non-intrusive approach has the ability to complement projections based on clinical tests, facilitating timely follow-up and response.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Wastewater surveillance of SARS-CoV-2 in dormitories as a part of comprehensive university campus COVID-19 monitoring
    Lu, Emily
    Ai, Yuehan
    Davis, Angela
    Straathof, Judith
    Halloran, Kent
    Hull, Natalie
    Winston, Ryan
    Weir, Mark H.
    Soller, Jeffrey
    Bohrerova, Zuzana
    Oglesbee, Michael
    Lee, Jiyoung
    ENVIRONMENTAL RESEARCH, 2022, 212
  • [22] Application of machine learning for multi-community COVID-19 outbreak predictions with wastewater surveillance
    Ai, Yuehan
    He, Fan
    Lancaster, Emma
    Lee, Jiyoung
    PLOS ONE, 2022, 17 (11):
  • [23] Rapid, Large-Scale Wastewater Surveillance and Automated Reporting System Enable Early Detection of Nearly 85% of COVID-19 Cases on a University Campus
    Karthikeyan, Smruthi
    Nguyen, Andrew
    McDonald, Daniel
    Zong, Yijian
    Ronquillo, Nancy
    Ren, Junting
    Zou, Jingjing
    Farmer, Sawyer
    Humphrey, Greg
    Henderson, Diana
    Javidi, Tara
    Messer, Karen
    Anderson, Cheryl
    Schooley, Robert
    Martin, Natasha K.
    Knight, Rob
    MSYSTEMS, 2021, 6 (04)
  • [24] COVID-19 Surveillance Wastewater-based epidemiology
    Mullasseri, Sileesh
    CURRENT SCIENCE, 2021, 120 (11): : 1660 - 1660
  • [25] Prediction of COVID-19 Active Cases Using Polynomial Regression and ARIMA Models
    Neily, Neji
    Ben Ammar, Boulbaba
    Kammoun, Habib M.
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 1351 - 1362
  • [26] Modified Machine Learning Techique for Curve Fitting on Regression Models for COVID-19 projections
    Andreas, Andreou
    Mavromoustalcis, Constandinos X.
    Mastorakis, George
    Mumtaz, Shahid
    Batalla, Jordi Mongay
    Pallis, Evangelos
    2020 IEEE 25TH INTERNATIONAL WORKSHOP ON COMPUTER AIDED MODELING AND DESIGN OF COMMUNICATION LINKS AND NETWORKS (CAMAD), 2020,
  • [27] Analysis of COVID-19 pandemic and forecasting using machine learning models
    Chauhan, Ekansh
    Sirswal, Manpreet
    Gupta, Deepak
    Khanna, Ashish
    Khamparia, Aditya
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2021, 66 (3-4) : 309 - 333
  • [28] COVID-19 surveillance in Southeastern Virginia using wastewater-based epidemiology
    Gonzalez, Raul
    Curtis, Kyle
    Bivins, Aaron
    Bibby, Kyle
    Weir, Mark H.
    Yetka, Kathleen
    Thompson, Hannah
    Keeling, David
    Mitchell, Jamie
    Gonzalez, Dana
    WATER RESEARCH, 2020, 186
  • [29] Analysis and Prediction of COVID-19 Data using Machine Learning Models
    Chrin, Richvichanak
    Wang, Sujing
    ACM International Conference Proceeding Series, 2021, : 296 - 301
  • [30] PREDICTING THE FUTURE COVID-19 USING SUPERVISED MACHINE LEARNING MODELS
    Shake, Saida
    Lavanya, Ch.
    Lakshmi, S. Venkata Maha
    Bhavani, B.
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (04) : 1375 - 1378