Enhancing the streamflow simulation of a process-based hydrological model using machine learning and multi-source data

被引:0
|
作者
Lei, Huajin [1 ,2 ,3 ]
Li, Hongyi [4 ]
Hu, Wanpin [5 ]
机构
[1] Xihua Univ, Sch Energy & Power Engn, Chengdu 610039, Peoples R China
[2] Xihua Univ, Key Lab Fluid Machinery & Engn, Chengdu 610039, Sichuan, Peoples R China
[3] Sichuan Univ, Coll Water Resource & Hydropower, Chengdu 610065, Peoples R China
[4] Chinese Acad Sci, Northwest Inst Ecoenvironm & Resources, Lanzhou 730070, Peoples R China
[5] Sichuan Inst Land Sci & Technol, Dept Nat Resources Sichuan Prov, Chengdu 610065, Peoples R China
关键词
Streamflow simulation; Process-based hydrological model; Machine learning; Hybrid modelling; Jialing River basin; ARTIFICIAL NEURAL-NETWORKS; VARIABLE SELECTION; BTOP MODEL; REGRESSION; UNCERTAINTY; EVAPORATION; PREDICTION; TOPMODEL;
D O I
10.1016/j.ecoinf.2024.102755
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Streamflow simulation is crucial for flood mitigation, ecological protection, and water resource planning. Process-based hydrological models and machine learning algorithms are the mainstream tools for streamflow simulation. However, their inherent limitations, such as time-consuming and large data requirements, make achieving high-precision simulations challenging. This study developed a hybrid approach to simultaneously improve the accuracy and computational efficiency of streamflow simulation, which integrates Block-wise use of the TOPMODEL (BTOP) model into the eXtreme Gradient Boosting (XGBoost), i.e., BTOP_XGB. In this approach, BTOP generates simulated streamflow using the Latin hypercube sampling algorithm instead of the time-consuming calibration algorithms to reduce computational costs. Then, XGBoost combines BTOP simulated streamflow with multi-source data to reduce simulation errors. In which, serval input variable selection algorithms are employed to choose relevant inputs and remove redundant information for model. The hybrid approach is validated and compared with a standalone model at three hydrological stations in the Jialing River basin, China. The results show that the performance of BTOP_XGB is significantly better than the BTOP and XGBoost models. The NSE of BTOP_XGB at Beibei, Xiaoheba, and Luoduxi stations increases by 54%, 21%, and 83%, respectively. Meanwhile, the computational time of BTOP_XGB is saved by >90% compared to the original calibrated BTOP. BTOP_XGB is less affected by parameter sample sizes and data amounts, demonstrating the robustness of the hybrid model. This study simplifies the complexity of the hydrological model and enhances the stability of machine learning, jointly improving the reliability of streamflow simulation. The hybrid approach provides a potential shortcut for streamflow simulation over basins with large areas or limited observed data.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Emulating process-based water quality modelling in water source reservoirs using machine learning
    Mohammed, Hadi
    Tornyeviadzi, Hoese Michel
    Seidu, Razak
    JOURNAL OF HYDROLOGY, 2022, 609
  • [32] Simulation Credibility Evaluation Based on Multi-source Data Fusion
    Zhou, Yuchen
    Fang, Ke
    Ma, Ping
    Yang, Ming
    METHODS AND APPLICATIONS FOR MODELING AND SIMULATION OF COMPLEX SYSTEMS, 2018, 946 : 18 - 31
  • [33] Estimation of Nitrogen Content in Winter Wheat Based on Multi-Source Data Fusion and Machine Learning
    Ding, Fan
    Li, Changchun
    Zhai, Weiguang
    Fei, Shuaipeng
    Cheng, Qian
    Chen, Zhen
    AGRICULTURE-BASEL, 2022, 12 (11):
  • [34] Process based calibration of a continental-scale hydrological model using soil moisture and streamflow data
    Bajracharya, Ajay Ratna
    Ahmed, Mohamed Ismaiel
    Stadnyk, Tricia
    Asadzadeh, Masoud
    JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2023, 47
  • [35] Integrated UAV-Based Multi-Source Data for Predicting Maize Grain Yield Using Machine Learning Approaches
    Guo, Yahui
    Zhang, Xuan
    Chen, Shouzhi
    Wang, Hanxi
    Jayavelu, Senthilnath
    Cammarano, Davide
    Fu, Yongshuo
    REMOTE SENSING, 2022, 14 (24)
  • [36] Multi-source precipitation estimation using machine learning: Clarification and benchmarking
    Xu, Yue
    Tang, Guoqiang
    Li, Lingjie
    Wan, Wei
    JOURNAL OF HYDROLOGY, 2024, 635
  • [37] Multi-Source Cyber-Attacks Detection using Machine Learning
    Taheri, Sona
    Gondal, Iqbal
    Bagirov, Adil
    Harkness, Greg
    Brown, Simon
    Chi, CHihung
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2019, : 1167 - 1172
  • [38] A deep learning model based on multi-source data for daily tourist volume forecasting
    Han, Wenjie
    Li, Yong
    Li, Yunpeng
    Huang, Tao
    CURRENT ISSUES IN TOURISM, 2024, 27 (05) : 768 - 786
  • [39] Enhancing Streamflow Prediction Physically Consistently Using Process-Based Modeling and Domain Knowledge: A Review
    Yifru, Bisrat Ayalew
    Lim, Kyoung Jae
    Lee, Seoro
    SUSTAINABILITY, 2024, 16 (04)
  • [40] Measuring Housing Vitality from Multi-Source Big Data and Machine Learning
    Zhou, Yang
    Xue, Lirong
    Shi, Zhengyu
    Wu, Libo
    Fan, Jianqing
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1045 - 1059