Stream water quality prediction using boosted regression tree and random forest models

被引:0
|
作者
Ali O. Alnahit
Ashok K. Mishra
Abdul A. Khan
机构
[1] Clemson University,Glenn Department of Civil Engineering
[2] King Saud University,Department of Civil Engineering
关键词
Water quality; Machine learning algorithms; Random forests; Boosted regression trees;
D O I
暂无
中图分类号
学科分类号
摘要
Reliable water quality prediction can improve environmental flow monitoring and the sustainability of the stream ecosystem. In this study, we compared two machine learning methods to predict water quality parameters, such as total nitrogen (TN), total phosphorus (TP), and turbidity (TUR), for 97 watersheds located in the Southeast Atlantic region of the USA. The modeling framework incorporates multiple climate and watershed variables (characteristics) that often control the water quality indicators in different landscapes. Three techniques, such as stepwise regression (SR), Least Absolute Shrinkage and Selection Operator (LASSO), and genetic algorithm (GA), are implemented to identify appropriate predictors out of 28 climate and catchment-related variables. The selected predictors were then used to develop the Random Forest (RF) and Boosted regression tree (BRT) models for water quality predictions in selected watersheds. The results highlighted that while both algorithms provided reasonable results (based on statistical metrics), the RF algorithm was easier to train and robust to model overfitting. Partial dependence plots highlighted the complex and nonlinear relationships between the individual predictors and the water quality indicators. The thresholds obtained from partial dependence plots showed that the median values of total nitrogen (TN) and total phosphorus (TP) in streams increase significantly when the percentage of urban and agricultural lands is above 40% and 43% of the watershed area, respectively. Furthermore, when soil hydraulic conductivity increases, the reduction in runoff results in decreased Turbidity levels in streams. Therefore, identifying the key watershed characteristics and their critical thresholds can help watershed managers create appropriate regulations for managing and sustaining healthy stream ecosystems. Besides, the forecasting models can improve water quality predictions in ungauged watersheds.
引用
收藏
页码:2661 / 2680
页数:19
相关论文
共 50 条
  • [31] Multivariate prediction of nitrogen concentration in a stream using regression models
    Andrea C. Aguilar
    Alexandra Cerón-Vivas
    Miguel Altuve
    [J]. Environmental Earth Sciences, 2021, 80
  • [32] Multivariate prediction of nitrogen concentration in a stream using regression models
    Aguilar, Andrea C.
    Ceron-Vivas, Alexandra
    Altuve, Miguel
    [J]. ENVIRONMENTAL EARTH SCIENCES, 2021, 80 (09)
  • [33] Daily Evapotranspiration Mapping Using Regression Random Forest Models
    Gonzalo-Martin, Consuelo
    Lillo-Saavedra, Mario
    Garcia-Pedrero, Angel
    Lagos, Octavio
    Menasalvas, Ernestina
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (12) : 5359 - 5368
  • [34] Prediction of meteorological drought and standardized precipitation index based on the random forest (RF), random tree (RT), and Gaussian process regression (GPR) models
    Elbeltagi, Ahmed
    Pande, Chaitanya B. B.
    Kumar, Manish
    Tolche, Abebe Debele
    Singh, Sudhir Kumar
    Kumar, Akshay
    Vishwakarma, Dinesh Kumar
    [J]. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2023, 30 (15) : 43183 - 43202
  • [35] Prediction of meteorological drought and standardized precipitation index based on the random forest (RF), random tree (RT), and Gaussian process regression (GPR) models
    Ahmed Elbeltagi
    Chaitanya B. Pande
    Manish Kumar
    Abebe Debele Tolche
    Sudhir Kumar Singh
    Akshay Kumar
    Dinesh Kumar Vishwakarma
    [J]. Environmental Science and Pollution Research, 2023, 30 : 43183 - 43202
  • [37] COVID-19 Patient Health Prediction Using Boosted Random Forest Algorithm
    Iwendi, Celestine
    Bashir, Ali Kashif
    Peshkar, Atharva
    Sujatha, R.
    Chatterjee, Jyotir Moy
    Pasupuleti, Swetha
    Mishra, Rishita
    Pillai, Sofia
    Jo, Ohyun
    [J]. FRONTIERS IN PUBLIC HEALTH, 2020, 8
  • [38] A prediction model for the floor impact sound using random forest regression
    HIRAKAWA, Susumu
    HIRAMITSU, Atsuo
    [J]. Journal of Environmental Engineering (Japan), 2021, 86 (779): : 25 - 33
  • [39] Trend prediction of irrigation area using improved random forest regression
    Wang, Maofa
    Huang, Hongliang
    Gao, Guangda
    Tang, Weiyu
    [J]. IRRIGATION AND DRAINAGE, 2022, 71 (04) : 1011 - 1023
  • [40] Prediction Analysis of Crop and Their Futuristic Yields Using Random Forest Regression
    Ramisetty, Uma Maheswari
    Kumar Gundavarapu, Venkata Nagesh
    Rajender, R.
    Ramirez, Isaac Segovia
    Garcia Marquez, Fausto Pedro
    [J]. IOT AND DATA SCIENCE IN ENGINEERING MANAGEMENT, 2023, 160 : 280 - 285