Super ensemble learning for daily streamflow forecasting: large-scale demonstration and comparison with multiple machine learning algorithms

被引:0
|
作者
Hristos Tyralis
Georgia Papacharalampous
Andreas Langousis
机构
[1] National Technical University of Athens,Department of Water Resources and Environmental Engineering, School of Civil Engineering
[2] Elefsina Air Base,Air Force Support Command, Hellenic Air Force
[3] University of Patras,Department of Civil Engineering, School of Engineering
来源
关键词
Combining forecasts; Ensemble learning; Hydrology; Stacking;
D O I
暂无
中图分类号
学科分类号
摘要
Daily streamflow forecasting through data-driven approaches is traditionally performed using a single machine learning algorithm. Existing applications are mostly restricted to examination of few case studies, not allowing accurate assessment of the predictive performance of the algorithms involved. Here, we propose super learning (a type of ensemble learning) by combining 10 machine learning algorithms. We apply the proposed algorithm in one-step-ahead forecasting mode. For the application, we exploit a big dataset consisting of 10-year long time series of daily streamflow, precipitation and temperature from 511 basins. The super ensemble learner improves over the performance of the linear regression algorithm by 20.06%, outperforming the “hard to beat in practice” equal weight combiner. The latter improves over the performance of the linear regression algorithm by 19.21%. The best performing individual machine learning algorithm is neural networks, which improves over the performance of the linear regression algorithm by 16.73%, followed by extremely randomized trees (16.40%), XGBoost (15.92%), loess (15.36%), random forests (12.75%), polyMARS (12.36%), MARS (4.74%), lasso (0.11%) and support vector regression (− 0.45%). Furthermore, the super ensemble learner outperforms exponential smoothing and autoregressive integrated moving average (ARIMA). These latter two models improve over the performance of the linear regression algorithm by 13.89% and 8.77%, respectively. Based on the obtained large-scale results, we propose super ensemble learning for daily streamflow forecasting.
引用
收藏
页码:3053 / 3068
页数:15
相关论文
共 50 条
  • [1] Super ensemble learning for daily streamflow forecasting: large-scale demonstration and comparison with multiple machine learning algorithms
    Tyralis, Hristos
    Papacharalampous, Georgia
    Langousis, Andreas
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (08): : 3053 - 3068
  • [2] A Large-Scale Ensemble Learning Framework for Demand Forecasting
    Park, Young-Jin
    Kim, Donghyun
    Odermatt, Frederic
    Lee, Juho
    Kim, Kyung-Min
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 378 - 387
  • [3] Comparison of four algorithms based on machine learning for cooling load forecasting of large-scale shopping mall
    Zhou Xuan
    Fan Zhubing
    Liang Liequan
    Yan Junwei
    Pan Dongmei
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON APPLIED ENERGY, 2017, 142 : 1799 - 1804
  • [4] Identifying major drivers of daily streamflow from large-scale atmospheric circulation with machine learning
    Hagen, Jenny Sjastad
    Leblois, Etienne
    Lawrence, Deborah
    Solomatine, Dimitri
    Sorteberg, Asgeir
    [J]. JOURNAL OF HYDROLOGY, 2021, 596
  • [5] Machine learning for large-scale crop yield forecasting
    Paudel, Dilli
    Boogaard, Hendrik
    de Wit, Allard
    Janssen, Sander
    Osinga, Sjoukje
    Pylianidis, Christos
    Athanasiadis, Ioannis N.
    [J]. AGRICULTURAL SYSTEMS, 2021, 187
  • [6] Using Optimized Deep Learning to Predict Daily Streamflow: A Comparison to Common Machine Learning Algorithms
    Khabat Khosravi
    Ali Golkarian
    John P. Tiefenbacher
    [J]. Water Resources Management, 2022, 36 : 699 - 716
  • [7] Using Optimized Deep Learning to Predict Daily Streamflow: A Comparison to Common Machine Learning Algorithms
    Khosravi, Khabat
    Golkarian, Ali
    Tiefenbacher, John P.
    [J]. WATER RESOURCES MANAGEMENT, 2022, 36 (02) : 699 - 716
  • [8] Aggregation models in ensemble learning: A large-scale comparison
    Campagner, Andrea
    Ciucci, Davide
    Cabitza, Federico
    [J]. INFORMATION FUSION, 2023, 90 : 241 - 252
  • [9] Large-scale comparison of machine learning algorithms for target prediction of natural products
    Liang, Lu
    Liu, Ye
    Kang, Bo
    Wang, Ru
    Sun, Meng-Yu
    Wu, Qi
    Meng, Xiang-Fei
    Lin, Jian-Ping
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [10] Forecasting daily solar radiation: An evaluation and comparison of machine learning algorithms
    Bin Nadeem, Talha
    Ali, Syed Usama
    Asif, Muhammad
    Suberi, Hari Kumar
    [J]. AIP ADVANCES, 2024, 14 (07)