Evaluation of the bias and precision of regression techniques and machine learning approaches in total dissolved solids modeling of an urban aquifer

被引:0
|
作者
Conglian Pan
Kelvin Tsun Wai Ng
Bahareh Fallah
Amy Richter
机构
[1] University of Regina,Environmental Systems Engineering
关键词
Total dissolved solids; Artificial neural network; Principal component regression; Multivariate statistical analysis; Machine learning methods; Bias and precision;
D O I
暂无
中图分类号
学科分类号
摘要
TDS is modeled for an aquifer near an unlined landfill in Canada. Canadian Drinking Water Guidelines and other indices are used to evaluate TDS concentrations in 27 monitoring wells surrounding the landfill. This study aims to predict TDS concentrations using three different modeling approaches: dual-step multiple linear regression (MLR), hybrid principal component regression (PCR), and backpropagation neural networks (BPNN). An analysis of the bias and precision of each models follows, using performance evaluation metrics and statistical indices. TDS is one of the most important parameters in assessing suitability of water for irrigation, and for overall groundwater quality assessment. Good agreement was observed between the MLR1 model and field data, although multicollinearity issues exist. Percentage errors of hybrid PCR were comparable to the dual-step MLR method. Percentage error for hybrid PCR was found to be inversely proportional to TDS concentrations, which was not observed for dual-step MLR. Larger errors were obtained from the BPNN models, and higher percentage errors were observed in monitoring wells with lower TDS concentrations. All models in this study adequately describe the data in testing stage (R2 > 0.86). Generally, the dual-step MLR and hybrid PCR models fared better (R2avg = 0.981 and 0.974, respectively), while BPNN models performed worse (R2avg = 0.904). For this dataset, both regression and machine learning models are more suited to predict mid-range data compared to extreme values. Advanced regression methods (hybrid PCR and dual-step MLR) are more advantageous compared to BPNN.
引用
收藏
页码:1821 / 1833
页数:12
相关论文
共 50 条
  • [1] Evaluation of the bias and precision of regression techniques and machine learning approaches in total dissolved solids modeling of an urban aquifer
    Pan, Conglian
    Ng, Kelvin Tsun Wai
    Fallah, Bahareh
    Richter, Amy
    [J]. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2019, 26 (02) : 1821 - 1833
  • [2] Modelling of total dissolved solids in water supply systems using regression and supervised machine learning approaches
    Anthony Ewusi
    Isaac Ahenkorah
    Derrick Aikins
    [J]. Applied Water Science, 2021, 11
  • [3] Modelling of total dissolved solids in water supply systems using regression and supervised machine learning approaches
    Ewusi, Anthony
    Ahenkorah, Isaac
    Aikins, Derrick
    [J]. APPLIED WATER SCIENCE, 2021, 11 (02)
  • [4] Optimization of State of the Art Fuzzy-Based Machine Learning Techniques for Total Dissolved Solids Prediction
    Hijji, Mohammad
    Chen, Tzu-Chia
    Ayaz, Muhammad
    Abosinnee, Ali S.
    Muda, Iskandar
    Razoumny, Yury
    Hatamiafkoueieh, Javad
    [J]. SUSTAINABILITY, 2023, 15 (08)
  • [5] Bias evaluation and minimization for estuarine total dissolved solids (TDS) patterns constructed using spatial interpolation techniques
    Ndou, Naledzani
    Nontongana, Nolonwabo
    [J]. Marine Pollution Bulletin, 2025, 210
  • [6] Supervised Machine Learning for Estimation of Total Suspended Solids in Urban Watersheds
    Moeini, Mohammadreza
    Shojaeizadeh, Ali
    Geza, Mengistu
    [J]. WATER, 2021, 13 (02)
  • [7] Machine Learning Regression Techniques for the Modeling of Complex Systems: An Overview
    Trinchero, Riccardo
    Canavero, Flavio
    [J]. IEEE Electromagnetic Compatibility Magazine, 2021, 10 (04) : 71 - 79
  • [8] Estimation of total dissolved solids (TDS) using new hybrid machine learning models
    Banadkooki, Fatemeh Barzegari
    Ehteram, Mohammad
    Panahi, Fatemeh
    Sammen, Saad Sh
    Othman, Faridah Binti
    EL-Shafie, Ahmed
    [J]. JOURNAL OF HYDROLOGY, 2020, 587
  • [9] Hybrid Machine Learning Ensemble Techniques for Modeling Dissolved Oxygen Concentration
    Abba, Sani Isah
    Linh, Nguyen Thi Thuy
    Abdullahi, Jazuli
    Ali, Shaban Ismael Albrka
    Pham, Quoc Bao
    Abdulkadir, Rabiu Aliyu
    Costache, Romulus
    Nam, Van Thai
    Anh, Duong Tran
    [J]. IEEE ACCESS, 2020, 8 : 157218 - 157237
  • [10] Evaluation of total dissolved solids in rivers by improved neuro fuzzy approaches using metaheuristic algorithms
    Mahdieh Jannatkhah
    Rouhollah Davarpanah
    Bahman Fakouri
    Ozgur Kisi
    [J]. Earth Science Informatics, 2024, 17 : 1501 - 1522