River Water Salinity Prediction Using Hybrid Machine Learning Models

被引:63
|
作者
Melesse, Assefa M. [1 ]
Khosravi, Khabat [2 ]
Tiefenbacher, John P. [3 ]
Heddam, Salim [4 ]
Kim, Sungwon [5 ]
Mosavi, Amir [6 ,7 ,8 ,9 ]
Pham, Binh Thai [10 ]
机构
[1] Florida Int Univ, Dept Earth & Environm, Miami, FL 33199 USA
[2] Sari Agr & Nat Resources Univ, Dept Watershed Management, Sari 4818168984, Iran
[3] Texas State Univ, Dept Geog, San Marcos, TX 78666 USA
[4] Univ 20 Aout 1955, Lab Res Biodivers Interact Ecosyst & Biotechnol, Route El Hadaik,BP 26, Skikda 21000, Algeria
[5] Dongyang Univ, Dept Railroad Construct & Safety Engn, Yeongju 36040, South Korea
[6] Tech Univ Dresden, Fac Civil Engn, D-01069 Dresden, Germany
[7] Norwegian Univ Life Sci, Sch Business & Econ, N-1430 As, Norway
[8] Thuringian Inst Sustainabil & Climate Protect, D-07743 Jena, Germany
[9] Obuda Univ, Inst Automat, H-1034 Budapest, Hungary
[10] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam
关键词
water salinity; machine learning; bagging; random forest; random subspace; data science; hydrological model; big data; hydroinformatics; electrical conductivity; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINES; RANDOM SUBSPACE ENSEMBLES; FUZZY INFERENCE SYSTEM; DATA MINING MODELS; DISSOLVED-OXYGEN; ELECTRICAL-CONDUCTIVITY; REGRESSION; GROUNDWATER; PERFORMANCE;
D O I
10.3390/w12102951
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Electrical conductivity (EC), one of the most widely used indices for water quality assessment, has been applied to predict the salinity of the Babol-Rood River, the greatest source of irrigation water in northern Iran. This study uses two individual-M5 Prime (M5P) and random forest (RF)-and eight novel hybrid algorithms-bagging-M5P, bagging-RF, random subspace (RS)-M5P, RS-RF, random committee (RC)-M5P, RC-RF, additive regression (AR)-M5P, and AR-RF-to predict EC. Thirty-six years of observations collected by the Mazandaran Regional Water Authority were randomly divided into two sets: 70% from the period 1980 to 2008 was used as model-training data and 30% from 2009 to 2016 was used as testing data to validate the models. Several water quality variables-pH, HCO3-, Cl-, SO42-, Na+, Mg2+, Ca2+, river discharge (Q), and total dissolved solids (TDS)-were modeling inputs. Using EC and the correlation coefficients (CC) of the water quality variables, a set of nine input combinations were established. TDS, the most effective input variable, had the highest EC-CC (r = 0.91), and it was also determined to be the most important input variable among the input combinations. All models were trained and each model's prediction power was evaluated with the testing data. Several quantitative criteria and visual comparisons were used to evaluate modeling capabilities. Results indicate that, in most cases, hybrid algorithms enhance individual algorithms' predictive powers. The AR algorithm enhanced both M5P and RF predictions better than bagging, RS, and RC. M5P performed better than RF. Further, AR-M5P outperformed all other algorithms (R-2 = 0.995, RMSE = 8.90 mu s/cm, MAE = 6.20 mu s/cm, NSE = 0.994 and PBIAS = -0.042). The hybridization of machine learning methods has significantly improved model performance to capture maximum salinity values, which is essential in water resource management.
引用
收藏
页码:1 / 21
页数:21
相关论文
共 50 条
  • [31] Earthquake Prediction using Hybrid Machine Learning Techniques
    Salam, Mustafa Abdul
    Ibrahim, Lobna
    Abdelminaam, Diaa Salama
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (05) : 654 - 665
  • [32] Water quality prediction using machine learning models based on grid search method
    Shams, Mahmoud Y.
    Elshewey, Ahmed M.
    El-kenawy, El-Sayed M.
    Ibrahim, Abdelhameed
    Talaat, Fatma M.
    Tarek, Zahraa
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 35307 - 35334
  • [33] Water quality prediction using machine learning models based on grid search method
    Mahmoud Y. Shams
    Ahmed M. Elshewey
    El-Sayed M. El-kenawy
    Abdelhameed Ibrahim
    Fatma M. Talaat
    Zahraa Tarek
    [J]. Multimedia Tools and Applications, 2024, 83 : 35307 - 35334
  • [34] Prediction of Water Quality Classification of the Kelantan River Basin, Malaysia, Using Machine Learning Techniques
    Malek, Nur Hanisah Abdul
    Yaacob, Wan Fairos Wan
    Nasir, Syerina Azlin Md
    Shaadan, Norshahida
    [J]. WATER, 2022, 14 (07)
  • [35] Water Quality Prediction of the Yamuna River in India Using Hybrid Neuro-Fuzzy Models
    Kisi, Ozgur
    Parmar, Kulwinder Singh
    Mahdavi-Meymand, Amin
    Adnan, Rana Muhammad
    Shahid, Shamsuddin
    Zounemat-Kermani, Mohammad
    [J]. WATER, 2023, 15 (06)
  • [36] River Water Temperature Prediction Using a Hybrid Model Based on Variational Mode Decomposition (VMD) and Outlier Robust Extreme Learning Machine
    Mirzania, Ehsan
    Roshni, Thendiyath
    Ghorbani, Mohammad Ali
    Heddam, Salim
    [J]. ENVIRONMENTAL PROCESSES-AN INTERNATIONAL JOURNAL, 2024, 11 (03):
  • [37] Sediment load prediction in Johor river: deep learning versus machine learning models
    Latif, Sarmad Dashti
    Chong, K. L.
    Ahmed, Ali Najah
    Huang, Y. F.
    Sherif, Mohsen
    El-Shafie, Ahmed
    [J]. APPLIED WATER SCIENCE, 2023, 13 (03)
  • [38] Sediment load prediction in Johor river: deep learning versus machine learning models
    Sarmad Dashti Latif
    K. L. Chong
    Ali Najah Ahmed
    Y. F. Huang
    Mohsen Sherif
    Ahmed El-Shafie
    [J]. Applied Water Science, 2023, 13
  • [39] Carbon price prediction using multiple hybrid machine learning models optimized by genetic algorithm
    Nadirgil, Ozan
    [J]. JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2023, 342
  • [40] Power prediction of wind turbine in the wake using hybrid physical process and machine learning models
    Zhou, Huanyu
    Qiu, Yingning
    Feng, Yanhui
    Liu, Jing
    [J]. RENEWABLE ENERGY, 2022, 198 : 568 - 586