Application of machine learning algorithms and feature selection methods for better prediction of sludge production in a real advanced biological wastewater treatment plant

被引:5
|
作者
Ekinci, Ekin [1 ]
Ozbay, Bilge [2 ]
Omurca, Sevinc Ilhan [3 ]
Sayin, Fatma Ece [2 ]
Ozbay, Ismail [2 ]
机构
[1] Sakarya Univ Appl Sci, Fac Technol, Comp Engn Dept, Sakarya, Turkiye
[2] Kocaeli Univ, Fac Engn, Environm Engn Dept, Kocaeli, Turkiye
[3] Kocaeli Univ, Fac Engn, Comp Engn Dept, Kocaeli, Turkiye
关键词
Municipal wastewater; Sludge production; Machine learning models; Prediction; Feature selection;
D O I
10.1016/j.jenvman.2023.119448
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Although the management of sewage sludge is an important and challenging task of wastewater treatment, there is a scarcity of studies on the prediction of waste sludge. To overcome this deficiency, the present work aims to develop an appropriate model providing accurate and fast prediction of sewage sludge. With this aim, different machine learning (ML) algorithms were tested by data obtained from a real advanced biological wastewater treatment plant located in Kocaeli, Turkey. In modelling studies, a data set from January 2022 to December 2022 composed of 208 daily measurements was considered. The flow rate of the plant (Q), polyelectrolyte dosage (PD) and removed amounts of total suspended solids (TSS), chemical oxygen demand (COD), biological oxygen demand (BOD), total phosphorous (TP), total nitrogen (TN) were assigned as input parameters to predict sludge production (SP). The precision of the models was evaluated in terms of Mean Square Error (MSE), Mean Absolute Percentage Error (MAPE), Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and correlation coefficient (R2). Among the various tested models Kernel Ridge Regression provided the best accuracy with R2 value of 0.94 and MAE value of 3.25. Mutual information-based feature selection (MIFS) and correlation-based feature selection (CFS) algorithms were also used in the study in order to enhance the model performance. Thus, higher prediction accuracies were achieved using the selected subset of features. Furthermore, importance contribution of features were calculated and visualized by SHapley Additive exPlanations (SHAP) technique. The overall results of the work indicate the feasibility of ML models for describing the dynamic and complex nature of SP. The process operators may benefit from this modelling approach since it enables accurate and fast estimation of sewage sludge by using fewer and easily measurable parameters.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Comparative study on total nitrogen prediction in wastewater treatment plant and effect of various feature selection methods on machine learning algorithms performance
    Bagherzadeh, Faramarz
    Mehrani, Mohamad-Javad
    Basirifard, Milad
    Roostaei, Javad
    [J]. JOURNAL OF WATER PROCESS ENGINEERING, 2021, 41
  • [2] Analysis of Machine Learning Models for Wastewater Treatment Plant Sludge Output Prediction
    Shao, Shuai
    Fu, Dianzheng
    Yang, Tianji
    Mu, Hailin
    Gao, Qiufeng
    Zhang, Yun
    [J]. SUSTAINABILITY, 2023, 15 (18)
  • [3] Application of ozonation to reduce biological sludge production in an industrial wastewater treatment plant
    Albuquerque, J. S.
    Domingos, J. C.
    Sant'Anna, G. L., Jr.
    Dezotti, M.
    [J]. WATER SCIENCE AND TECHNOLOGY, 2008, 58 (10) : 1971 - 1976
  • [4] Analyzing the impact of feature selection methods on machine learning algorithms for heart disease prediction
    Noroozi, Zeinab
    Orooji, Azam
    Erfannia, Leila
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [5] Analyzing the impact of feature selection methods on machine learning algorithms for heart disease prediction
    Zeinab Noroozi
    Azam Orooji
    Leila Erfannia
    [J]. Scientific Reports, 13
  • [6] Input variable selection using machine learning and global sensitivity methods for the control of sludge bulking in a wastewater treatment plant
    Hvala, Nadja
    Kocijan, Jus
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2021, 154
  • [7] MANAGEMENT OF WASTE SLUDGE IN PASAKOY ADVANCED BIOLOGICAL WASTEWATER TREATMENT PLANT
    Turkmenler, H.
    Aslan, M.
    [J]. JOURNAL OF ENVIRONMENTAL PROTECTION AND ECOLOGY, 2015, 16 (01): : 214 - 221
  • [8] Prediction of Wastewater Quality at a Wastewater Treatment Plant Inlet Using a System Based on Machine Learning Methods
    Wodecka, Barbara
    Drewnowski, Jakub
    Bialek, Anita
    Lazuka, Ewa
    Szulzyk-Cieplak, Joanna
    [J]. PROCESSES, 2022, 10 (01)
  • [9] Early Prediction of Diabetes Using Feature Selection and Machine Learning Algorithms
    Abdollahi J.
    Aref S.
    [J]. SN Computer Science, 5 (2)
  • [10] Wastewater Plant Reliability Prediction Using the Machine Learning Classification Algorithms
    Velimirovic, Lazar Z.
    Jankovic, Radmila
    Velimirovic, Jelena D.
    Janjic, Aleksandar
    [J]. SYMMETRY-BASEL, 2021, 13 (08):