Assessing and predicting water quality index with key water parameters by machine learning models in coastal cities, China

被引:5
|
作者
Xu, Jing [1 ]
Mo, Yuming [2 ]
Zhu, Senlin [1 ]
Wu, Jinran [3 ]
Jin, Guangqiu [4 ]
Wang, You-Gan [5 ]
Ji, Qingfeng [1 ]
Li, Ling [6 ]
机构
[1] Yangzhou Univ, Coll Hydraul Sci & Engn, Yangzhou, Peoples R China
[2] Jiangsu Univ Sci & Technol, Sch Naval Architecture & Ocean Engn, Zhenjiang, Peoples R China
[3] Australian Catholic Univ, Inst Posit Psychol & Educ, North Sydney, Australia
[4] Hohai Univ, Natl Key Lab Water Disaster Prevent, Nanjing, Peoples R China
[5] Univ Queensland, Sch Math & Phys, Brisbane, Qld, Australia
[6] Westlake Univ, Sch Engn, Key Lab Coastal Environm & Resources Zhejiang Prov, Hangzhou, Peoples R China
关键词
Water quality; Key water parameters; Water quality index (WQI); Machine learning models; Coastal cities; YELLOW SEA; MICROPLASTIC POLLUTION; STATISTICAL-ANALYSIS; SUSPENDED SEDIMENT; SPATIAL VARIATIONS; DISSOLVED-OXYGEN; RIVER-BASIN; FRESH-WATER; CLASSIFICATION; PERFORMANCE;
D O I
10.1016/j.heliyon.2024.e33695
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The water quality index (WQI) is a widely used tool for comprehensive assessment of river environments. However, its calculation involves numerous water quality parameters, making sample collection and laboratory analysis time-consuming and costly. This study aimed to identify key water parameters and the most reliable prediction models that could provide maximum accuracy using minimal indicators. Water quality from 2020 to 2023 were collected including nine biophysical and chemical indicators in seventeen rivers in Yancheng and Nantong, two coastal cities in Jiangsu Province, China, adjacent to the Yellow Sea. Linear regression and seven machine learning models (Artificial Neural Network (ANN), Self-Organizing Maps (SOM), K-Nearest Neighbor (KNN), Support Vector Machines (SVM), Random Forest (RF), Extreme Gradient Boosting (XGB) and Stochastic Gradient Boosting (SGB)) were developed to predict WQI using different groups of input variables based on correlation analysis. The results indicated that water quality improved from 2020 to 2022 but deteriorated in 2023, with inland stations exhibiting better conditions than coastal ones, particularly in terms of turbidity and nutrients. The water environment was comparatively better in Nantong than in Yancheng, with mean WQI values of approximately 55.3-72.0 and 56.4-67.3, respectively. The classifications "Good" and "Medium" accounted for 80 % of the records, with no instances of "Excellent" and 2 % classified as "Bad". The performance of all prediction models, except for SOM, improved with the addition of input variables, achieving R2 values higher than 0.99 in models such as SVM, RF, XGB, and SGB. The most reliable models were RF and XGB with key parameters of total phosphorus (TP), ammonia nitrogen (AN), and dissolved oxygen (DO) (R2 = 0.98 and 0.91 for training and testing phase) for predicting WQI values, and RF using TP and AN (accuracy higher than 85 %) for WQI grades. The prediction accuracy for "Medium" and "Low" water quality grades was highest at 90 %, followed by the "Good" level at 70 %. The model results could contribute to efficient water quality evaluation by identifying key water parameters and facilitating effective water quality management in river basins.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data
    Chen, Kangyang
    Chen, Hexia
    Zhou, Chuanlong
    Huang, Yichao
    Qi, Xiangyang
    Shen, Ruqin
    Liu, Fengrui
    Zuo, Min
    Zou, Xinyi
    Wang, Jinfeng
    Zhang, Yan
    Chen, Da
    Chen, Xingguo
    Deng, Yongfeng
    Ren, Hongqiang
    WATER RESEARCH, 2020, 171 (171)
  • [42] Developing a novel tool for assessing the groundwater incorporating water quality index and machine learning approach
    Sajib, Abdul Majed
    Diganta, Mir Talas Mahammad
    Rahman, Azizur
    Dabrowski, Tomasz
    Olbert, Agnieszka I.
    Uddin, Md Galal
    GROUNDWATER FOR SUSTAINABLE DEVELOPMENT, 2023, 23
  • [43] An Analysis of the Spatiotemporal Variability of Key Water Quality Parameters in China
    Li, Kexin
    Yang, Qichun
    Li, Xia
    HYDROLOGY, 2024, 11 (09)
  • [44] An advanced deep learning model for predicting water quality index
    Ehteram, Mohammad
    Ahmed, Ali Najah
    Sherif, Mohsen
    El-Shafie, Ahmed
    ECOLOGICAL INDICATORS, 2024, 160
  • [45] Assessing skill of estuarine and coastal eutrophication models for water quality managers
    Fitzpatrick, James J.
    JOURNAL OF MARINE SYSTEMS, 2009, 76 (1-2) : 195 - 211
  • [46] Predicting Aquaculture Water Quality Using Machine Learning Approaches
    Li, Tingting
    Lu, Jian
    Wu, Jun
    Zhang, Zhenhua
    Chen, Liwei
    WATER, 2022, 14 (18)
  • [47] Application of machine learning methods on predicting irrigation water quality
    Lin Y.P.
    Lien W.Y.
    Chen H.Y.
    He J.H.
    Chou C.F.
    Taiwan Water Conservancy, 2020, 68 (01): : 1 - 14
  • [48] An Enhanced Water Quality Index for Water Quality Monitoring Using Remote Sensing and Machine Learning
    Ahmed, Mehreen
    Mumtaz, Rafia
    Anwar, Zahid
    APPLIED SCIENCES-BASEL, 2022, 12 (24):
  • [49] Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index
    Sani Isah Abba
    Quoc Bao Pham
    Gaurav Saini
    Nguyen Thi Thuy Linh
    Ali Najah Ahmed
    Meriame Mohajane
    Mohammadreza Khaledian
    Rabiu Aliyu Abdulkadir
    Quang-Vu Bach
    Environmental Science and Pollution Research, 2020, 27 : 41524 - 41539
  • [50] Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index
    Abba, Sani Isah
    Pham, Quoc Bao
    Saini, Gaurav
    Linh, Nguyen Thi Thuy
    Ahmed, Ali Najah
    Mohajane, Meriame
    Khaledian, Mohammadreza
    Abdulkadir, Rabiu Aliyu
    Bach, Quang-Vu
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2020, 27 (33) : 41524 - 41539