Hyperparameter tuning of supervised bagging ensemble machine learning model using Bayesian optimization for estimating stormwater quality

被引:0
|
作者
Mohammadreza Moeini
机构
[1] University of Illinois at Chicago,Department of Civil, Materials, and Environmental Engineering
来源
关键词
Bayesian optimization; Machine learning; Ensemble modeling; Stormwater quality; Urban watershed;
D O I
暂无
中图分类号
学科分类号
摘要
Physically based models (PBMs), including stormwater management model (SWMM), require a significant amount of in situ data and expertise to predict water quality in urban watersheds. In recent years, data-driven models have been increasingly used as an alternative for the prediction of pollutant concentrations. Supervised machine learning (ML) models have been used for estimating stormwater quality parameters. However, optimizing the structure of such ML models has rarely been considered. This study aims to comprehensively evaluate the optimization of the supervised ensemble bagging ML model for forecasting stormwater quality using an ML-based optimization method called Bayesian optimization (BO). To that end, a bagging ensemble model, namely random forest (RF), was first developed for estimating total suspended solids (TSS) concentration in urban watersheds. Eleven factors, including drainage area, land-use types, impervious area, rainfall depth, the volume of runoff, and antecedent dry days, were implemented as predictive features in the model, and their data were acquired from the National Stormwater Quality Database (NSQD). Values for the number of basic estimators, the number of basic selected features for developing basic estimators, subsamples, and the maximum depth of basic learners were optimized using BO. A sensitivity analysis was done on the ML model and the BO parameters, including acquisition function, number of initial points, and realizations. Results indicated that the accuracy of the RF model depends on all mentioned RF parameters. The performance of the best-developed RF model was satisfactory in both the training and the testing steps. This model obtained the R2 values of 0.955 and 0.915 for the training and testing step, respectively. The study demonstrated the potential of a combination of the RF models and BO for accurately predicting stormwater quality parameters.
引用
收藏
相关论文
共 50 条
  • [41] Enhancing Prediction Performance of Landslide Susceptibility Model Using Hybrid Machine Learning Approach of Bagging Ensemble and Logistic Model Tree
    Xuan Luan Truong
    Mitamura, Muneki
    Kono, Yasuyuki
    Raghavan, Venkatesh
    Yonezawa, Go
    Xuan Quang Truong
    Thi Hang Do
    Dieu Tien Bui
    Lee, Saro
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (07):
  • [42] Surrogate Model Based on an MLP Neural Network and Bayesian Hyperparameter Tuning for Ship Hull Form Optimization
    Zhang, Yi
    Ma, Ning
    Gu, Xiechong
    Shi, QiQi
    [J]. INTERNATIONAL JOURNAL OF OFFSHORE AND POLAR ENGINEERING, 2023, 33 (02) : 184 - 195
  • [43] Finding Location Visiting Preference from Personal Features with Ensemble Machine Learning Techniques and Hyperparameter Optimization
    Kim, Young Myung
    Song, Ha Yoon
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (13):
  • [44] Machine Learning-based Test Case Prioritization using Hyperparameter Optimization
    Khan, Md Asif
    Azim, Akramul
    Liscano, Ramiro
    Smith, Kevin
    Tauseef, Qasim
    Seferi, Gkerta
    Chang, Yee-Kang
    [J]. PROCEEDINGS OF THE 2024 IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATION OF SOFTWARE TEST, AST 2024, 2024, : 125 - 135
  • [45] Credit Default Risk Analysis Using Machine Learning Algorithms with Hyperparameter Optimization
    Inga, Juan
    Sacoto-Cabrera, Erwin
    [J]. INTELLIGENT TECHNOLOGIES: DESIGN AND APPLICATIONS FOR SOCIETY, CITIS 2022, 2023, 607 : 81 - 95
  • [46] Efficient tuning of Individual Pitch Control: A Bayesian Optimization Machine Learning approach
    Mulders, S. P.
    Pamososuryo, A. K.
    van Wingerden, J. W.
    [J]. SCIENCE OF MAKING TORQUE FROM WIND (TORQUE 2020), PTS 1-5, 2020, 1618
  • [47] Bagging-based positive-unlabeled learning algorithm with Bayesian hyperparameter optimization for three-dimensional mineral potential mapping
    Zhang, Zhiqiang
    Wang, Gongwen
    Liu, Chong
    Cheng, Lizhen
    Sha, Deming
    [J]. COMPUTERS & GEOSCIENCES, 2021, 154
  • [48] Building a Classification Model based on Feature Engineering for the Prediction of Wine Quality by Employing Supervised Machine Learning and Ensemble Learning Techniques
    Nandan, Mauparna
    Gupta, Harsh Raj
    Mondal, Moutusi
    [J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL & COMMUNICATION ENGINEERING, ICCECE, 2023,
  • [49] Network Intrusion Detection and Prevention System Using Hybrid Machine Learning with Supervised Ensemble Stacking Model
    Mills, Godfrey A.
    Acquah, Daniel K.
    Sowah, Robert A.
    [J]. Journal of Computer Networks and Communications, 2024, 2024
  • [50] High Accuracy Predictive Model on Breast Cancer Using Ensemble Approach of Supervised Machine Learning Algorithms
    Kaul, Chaitanya
    Sharma, Neeraj
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 71 - +