Virtual sample generation empowers machine learning-based effluent prediction in constructed wetlands

被引:8
|
作者
Dong, Qiyu [1 ]
Bai, Shunwen [1 ]
Wang, Zhen [1 ]
Zhao, Xinyue [2 ]
Yang, Shanshan [1 ]
Ren, Nanqi [1 ]
机构
[1] Harbin Inst Technol, Sch Environm, State Key Lab Urban Water Resource & Environm, Harbin 150090, Peoples R China
[2] Northeast Agr Univ, Coll Resource & Environm, Harbin 150030, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Constructed wetland design; Effluent quality prediction; Machine learning; Virtual sample generation; ARTIFICIAL-INTELLIGENCE; ORGANIC-MATTER; PERFORMANCE; FLOW; NITROGEN; REMOVAL; MEDIA;
D O I
10.1016/j.jenvman.2023.118961
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The design of constructed wetlands (CWs) is critical to ensure effective wastewater treatment. However, limited availability of reliable data can hamper the accuracy of CW effluent predictions, thus increasing design costs and time. In this study, a novel effluent prediction framework for CWs is proposed, utilizing data dimensionality reduction and virtual sample generation. By using four the machine learning algorithms (Cubist, random forest, support vector regression, and extreme learning machine), important features of CW design are identified and used to build prediction models. The extreme learning machine algorithm achieved the highest determination coefficient and lowest error, identifying it as the most suitable algorithm for effluent prediction. A multidistribution mega-trend-diffusion algorithm with particle swarm optimization was employed to generate virtual samples. These virtual samples were then combined with real samples to retrain the prediction model and verify the optimization effect. Comparative analysis demonstrated that the integration of virtual samples significantly improved the prediction accuracy for ammonium and chemical oxygen demand. The root mean square error decreased by averages of 60.5% and 42.1%, respectively, and the mean absolute percentage error by averages of 21.5% and 23.8%, respectively. Finally, a CW design process is proposed based on prediction models and virtual samples. This integrated forward prediction and reverse design tool can efficiently support CW design when sample sizes are limited, ultimately leading to more accurate and cost-effective design solutions.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Machine Learning-Based Virtual Screening and Identification of the Fourth-Generation EGFR Inhibitors
    Chang, Hao
    Zhang, Zeyu
    Tian, Jiaxin
    Bai, Tian
    Xiao, Zijie
    Wang, Dianpeng
    Qiao, Renzhong
    Li, Chao
    ACS OMEGA, 2024, 9 (02): : 2314 - 2324
  • [12] Development of a machine learning-based acuity score prediction model for virtual care settings
    Hall, Justin N.
    Galaev, Ron
    Gavrilov, Marina
    Mondoux, Shawn
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [13] Development of a machine learning-based acuity score prediction model for virtual care settings
    Justin N. Hall
    Ron Galaev
    Marina Gavrilov
    Shawn Mondoux
    BMC Medical Informatics and Decision Making, 23
  • [14] Machine Learning-Based Fifth-Generation Network Traffic Prediction Using Federated Learning
    Harir, Mohamed Abdelkarim Nimir
    Ataro, Edwin
    Nyah, Clement Temaneh
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (01) : 304 - 313
  • [15] Improving the case-based reasoning prediction of the compliance of treated effluent from constructed wetlands
    Crapper, Martin
    Ellis, Ruth
    Furber, Alison
    CIVIL ENGINEERING AND ENVIRONMENTAL SYSTEMS, 2010, 27 (02) : 123 - 132
  • [16] Prediction of Chloride Diffusion Coefficient in Concrete Based on Machine Learning and Virtual Sample Algorithm
    Zhou, Fei-Yu
    Tao, Ning-Jing
    Zhang, Yu-Rong
    Yuan, Wei-Bin
    SUSTAINABILITY, 2023, 15 (24)
  • [17] Machine Learning-based BGP Traffic Prediction
    Farasat, Talaya
    Rathore, Muhammad Ahmad
    Khan, Akmal
    Kim, JongWon
    Posegga, Joachim
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 1925 - 1934
  • [18] Method and application of sand body thickness prediction based on virtual sample machine learning
    Zhen, Yan
    Zhao, Zhen
    Zhao, Xiaoming
    Ge, Jiawang
    Zhang, An
    Yang, Changcheng
    GEOPHYSICS, 2024, 89 (06) : M169 - M184
  • [19] Machine learning-based prediction models in neurosurgery
    Habashy, Karl J.
    Arrieta, Victor A.
    Feghali, James
    NEUROSURGICAL FOCUS, 2023, 55 (03)
  • [20] Machine Learning-based Prediction of Test Power
    Dhotre, Harshad
    Eggersgluess, Stephan
    Chakrabarty, Krishnendu
    Drechsler, Rolf
    2019 IEEE EUROPEAN TEST SYMPOSIUM (ETS), 2019,