Proposed formulation of surface water quality and modelling using gene expression, machine learning, and regression techniques

被引:74
|
作者
Shah, Muhammad Izhar [1 ]
Javed, Muhammad Faisal [1 ]
Abunama, Taher [2 ]
机构
[1] COMSATS Univ Islamabad, Dept Civil Engn, Abbottabad Campus, Abbottabad 22060, Pakistan
[2] Durban Univ Technol, Inst Water & Wastewater Technol, POB 1334, Durban, South Africa
关键词
Surface water quality; Machine learning algorithms; Regression; Sensitivity and parametric analyses; k-fold cross-validation; ARTIFICIAL NEURAL-NETWORKS; UPPER INDUS BASIN; UNCERTAINTY ANALYSIS; PARAMETERS; GROUNDWATER; PREDICTION; ANN;
D O I
10.1007/s11356-020-11490-9
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The rising water pollution from anthropogenic factors motivates further research in developing water quality predicting models. The available models have certain limitations due to limited timespan data and the incapability to provide empirical expressions. This study is devoted to model and derive empirical equations for surface water quality of upper Indus river basin using a 30-year dataset with machine learning techniques and then to determine the most reliable model capable to accurately predict river water quality. Total dissolve solids (TDS) and electrical conductivity (EC) were used as dependent variables, whereas eight parameters were used as independent variables with 70 and 30% data for model training and testing, respectively. Various evaluation criteria, i.e., Nash-Sutcliffe efficiency (NSE), root mean square error (RMSE), coefficient of determination (R-2), and mean absolute error (MAE), were used to assess the performance of models. The data is also validated with the help of k-fold cross-validation using R-2 and RMSE. The results indicated a strong correlation with NSE and R-2 both above 0.85 for all the developed models. Gene expression programming (GEP) outperformed both artificial neural network (ANN) and linear and non-linear regression models for TDS and EC. The sensitivity and parametric analyses revealed that bicarbonate is the most sensitive parameter influencing both TDS and EC models. Two equations were derived and formulated to represent the novel results of GEP model to help authorities in the effective monitoring of river water quality.
引用
收藏
页码:13202 / 13220
页数:19
相关论文
共 50 条
  • [1] Proposed formulation of surface water quality and modelling using gene expression, machine learning, and regression techniques
    Muhammad Izhar Shah
    Muhammad Faisal Javed
    Taher Abunama
    [J]. Environmental Science and Pollution Research, 2021, 28 : 13202 - 13220
  • [2] ARTIFICIAL NEURAL NETWORK AND REGRESSION TECHNIQUES IN MODELLING SURFACE WATER QUALITY
    Merdun, Hasan
    Cinar, Oezer
    [J]. ENVIRONMENT PROTECTION ENGINEERING, 2010, 36 (02): : 95 - 109
  • [3] Machine learning techniques in river water quality modelling: a research travelogue
    Khullar, Sakshi
    Singh, Nanhey
    [J]. WATER SUPPLY, 2021, 21 (01) : 1 - 13
  • [4] Temporal Dynamics and Predictive Modelling of Streamflow and Water Quality Using Advanced Statistical and Ensemble Machine Learning Techniques
    Farzana, Syeda Zehan
    Paudyal, Dev Raj
    Chadalavada, Sreeni
    Alam, Md Jahangir
    [J]. WATER, 2024, 16 (15)
  • [5] Assessment of surface water quality in the Sebou watershed (Morocco) using a nonparametric approach and machine learning techniques
    Khalid Chadli
    [J]. Arabian Journal of Geosciences, 2023, 16 (9)
  • [6] Effective monitoring of Noyyal River surface water quality using remote sensing and machine learning and GIS techniques
    Adilakshmi, A.
    Venkatesan, V.
    [J]. DESALINATION AND WATER TREATMENT, 2024, 320
  • [7] Ground Water Quality Analysis using Machine Learning Techniques: a Critical Appraisal
    Chandel, Naman
    Gupta, Sushindra Kumar
    Ravi, Anand Kumar
    [J]. JOURNAL OF MINING AND ENVIRONMENT, 2024, 15 (02): : 419 - 426
  • [8] Water quality assurance in aquaculture ponds using Machine Learning and IoT techniques
    Quintero, Ricardo
    Parra, Jaqueline
    Felix, Francisco
    [J]. 2022 IEEE MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE (ENC), 2022,
  • [9] Evaporation modelling using different machine learning techniques
    Wang, Lunche
    Kisi, Ozgur
    Hu, Bo
    Bilal, Muhammad
    Zounemat-Kermani, Mohammad
    Li, Hui
    [J]. INTERNATIONAL JOURNAL OF CLIMATOLOGY, 2017, 37 : 1076 - 1092
  • [10] Forecasting Dengue Fever Using Machine Learning Regression Techniques
    Baker, Qanita Bani
    Faraj, Dalya
    Alguzo, Alanoud
    [J]. 2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 157 - 163