Interpretable prediction, classification and regulation of water quality: A case study of Poyang Lake, China

被引:14
|
作者
Yao, Zhiyuan [1 ]
Wang, Zhaocai [1 ]
Huang, Jinghan [2 ]
Xu, Nannan [1 ]
Cui, Xuefei [3 ]
Wu, Tunhua [4 ]
机构
[1] Shanghai Ocean Univ, Coll Informat, Shanghai 201306, Peoples R China
[2] Shanghai Ocean Univ, Coll Econ & Management, Shanghai 201306, Peoples R China
[3] Shanghai Ocean Univ, Coll Engn, Shanghai 201306, Peoples R China
[4] Wenzhou Med Univ, Sch Informat & Engn, Wenzhou 325035, Peoples R China
关键词
Explainable Artificial Intelligence; Water quality prediction; Water quality classification; Water quality regulation; Spatiotemporal data fusion;
D O I
10.1016/j.scitotenv.2024.175407
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Effective identification and regulation of water quality impact factors is essential for water resource management and environmental protection. However, the complex coupling of water quality systems poses a significant challenge to this task. This study proposes coherent model for water quality prediction, classification and regulation based on interpretable machine learning. The decomposition-reconstruction module is used to transform non-stationary water quality series into stationary series while effectively reducing the feature dimensions. Spatiotemporal multi-source data is introduced by using the Maximum Information Coefficient (MIC) for feature selection. The Temporal Convolutional Network (TCN) is used to extract the temporal features of different variables, followed by the introduction of External Attention mechanism (EA) to construct the relationship between these features. Finally, the target water quality sequence is simulated using Gated Recurrent Unit (GRU). The proposed model was applied to Poyang Lake in China to predict six water quality indicators: ammonia nitrogen (NH3-N), dissolved oxygen (DO), pH, total nitrogen (TN), total phosphorus (TP), water temperature (WT). The water quality was then classified based on the prediction results using the XGBoost algorithm. The findings indicate that the proposed model's Nash-Sutcliff Efficiency (NSE) value ranges from 0.88 to 0.99, surpassing that of the benchmark model, and demonstrates strong interval prediction performance. The results highlight the superior performance of the XGBoost algorithm (with an accuracy of 0.89) in addressing water quality classification issues, particularly in cases of category imbalance. Subsequently, interpretability analysis using the SHapley Additive exPlanation (SHAP) method revealed that the model is capable of learning relationships between different variables and there exists a possibility of learning the physical laws. Ultimately, this study proposes a water quality regulation mechanism that improves TN and DO levels by stepwise changing the magnitude of water temperature, which significantly improves in the case of data limitations. In conclusion, this study presents an overall framework for integrating water quality prediction, classification and improvement for the first time, forming a complete set of water quality early warning and improvement management strategies. This framework provides new ideas and ways for lake water quality management.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Comparison of random forests and other statistical methods for the prediction of lake water level: a case study of the Poyang Lake in China
    Li, Bing
    Yang, Guishan
    Wan, Rongrong
    Dai, Xue
    Zhang, Yanhui
    HYDROLOGY RESEARCH, 2016, 47 : 69 - 83
  • [2] Transformer Based Water Level Prediction in Poyang Lake, China
    Xu, Jiaxing
    Fan, Hongxiang
    Luo, Minghan
    Li, Piji
    Jeong, Taeseop
    Xu, Ligang
    WATER, 2023, 15 (03)
  • [3] Correlation Analysis of Water Quality Between Lake Inflow and Outflow: A Case Study of Poyang Lake
    Huang D.-L.
    Ni Z.-K.
    Zhao S.
    Zhang B.-T.
    Feng M.-L.
    Chen H.-W.
    Li X.-X.
    Wang S.-R.
    Huanjing Kexue/Environmental Science, 2019, 40 (10): : 4450 - 4460
  • [4] Hydrodynamic and water quality modeling of a large floodplain lake (Poyang Lake) in China
    Bing Li
    Guishan Yang
    Rongrong Wan
    Hengpeng Li
    Environmental Science and Pollution Research, 2018, 25 : 35084 - 35098
  • [5] Hydrodynamic and water quality modeling of a large floodplain lake (Poyang Lake) in China
    Li, Bing
    Yang, Guishan
    Wan, Rongrong
    Li, Hengpeng
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2018, 25 (35) : 35084 - 35098
  • [6] Water age prediction and its potential impacts on water quality using a hydrodynamic model for Poyang Lake, China
    Hengda Qi
    Jianzhong Lu
    Xiaoling Chen
    Sabine Sauvage
    José-Miguel Sanchez-Pérez
    Environmental Science and Pollution Research, 2016, 23 : 13327 - 13341
  • [7] Water age prediction and its potential impacts on water quality using a hydrodynamic model for Poyang Lake, China
    Qi, Hengda
    Lu, Jianzhong
    Chen, Xiaoling
    Sauvage, Sabine
    Sanchez-Perez, Jose-Miguel
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2016, 23 (13) : 13327 - 13341
  • [8] Water quality characteristics of Poyang Lake, China, in response to changes in the water level
    Liu, Xia
    Teubner, Katrin
    Chen, Yuwei
    HYDROLOGY RESEARCH, 2016, 47 : 238 - 248
  • [9] Water quality assessment based on the water quality index method in Lake Poyang: The largest freshwater lake in China
    Zhaoshi Wu
    Dawen Zhang
    Yongjiu Cai
    Xiaolong Wang
    Lu Zhang
    Yuwei Chen
    Scientific Reports, 7
  • [10] Water quality assessment based on the water quality index method in Lake Poyang: The largest freshwater lake in China
    Wu, Zhaoshi
    Zhang, Dawen
    Cai, Yongjiu
    Wang, Xiaolong
    Zhang, Lu
    Chen, Yuwei
    SCIENTIFIC REPORTS, 2017, 7