Identifying major drivers of daily streamflow from large-scale atmospheric circulation with machine learning

被引:23
|
作者
Hagen, Jenny Sjastad [1 ,2 ]
Leblois, Etienne [3 ]
Lawrence, Deborah [4 ]
Solomatine, Dimitri [5 ]
Sorteberg, Asgeir [1 ,2 ]
机构
[1] Univ Bergen, Geophys Inst, Allegaten 70, N-5007 Bergen, Norway
[2] Univ Bergen, Bjerknes Ctr Climate Res, Jahnebakken 5, N-5007 Bergen, Norway
[3] French Natl Inst Agr Food & Environm INRAE, Riverly Lyon Res Unit, 5 Rue Doua, F-69625 Villeurbanne, France
[4] Norwegian Water Resources & Energy Directorate NV, Middelthuns Gate 29, N-0368 Oslo, Norway
[5] IHE Delft Inst Water Educ, Westvest 7, NL-2611 AX Delft, Netherlands
关键词
Direct downscaling; Discharge; Automated feature extraction; ERA5; ERA-Interim; SUPPORT-VECTOR; CLIMATE; PERFORMANCE; MODELS; FLOODS;
D O I
10.1016/j.jhydrol.2021.126086
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Previous studies linking large-scale atmospheric circulation and river flow with traditional machine learning techniques have predominantly explored monthly, seasonal or annual streamflow modelling for applications in direct downscaling or hydrological climate-impact studies. This paper identifies major drivers of daily streamflow from large-scale atmospheric circulation using two reanalysis datasets for six catchments in Norway representing various Koppen-Geiger climate types and flood-generating processes. A nested loop of roughly pruned random forests is used for feature extraction, demonstrating the potential for automated retrieval of physically consistent and interpretable input variables. Random forest (RF), support vector machine (SVM) for regression and multilayer perceptron (MLP) neural networks are compared to multiple-linear regression to assess the role of model complexity in utilizing the identified major drivers to reconstruct streamflow. The machine learning models were trained on 31 years of aggregated atmospheric data with distinct moving windows for each catchment, reflecting catchment-specific forcing-response relationships between the atmosphere and the rivers. The results show that accuracy improves to some extent with model complexity. In all but the smallest, rainfall-driven catchment, the most complex model, MLP, gives a Nash-Sutcliffe Efficiency (NSE) ranging from 0.71 to 0.81 on testing data spanning five years. The poorer performance by all models in the smallest catchment is discussed in relation to catchment characteristics, sub-grid topography and local variability. The intra-model differences are also viewed in relation to the consistency between the automatically retrieved feature selections from the two reanalysis datasets. This study provides a benchmark for future development of deep learning models for direct downscaling from large-scale atmospheric variables to daily streamflow in Norway.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Large-Scale Machine Learning and Neuroimaging in Psychiatry
    Thompson, Paul
    [J]. BIOLOGICAL PSYCHIATRY, 2018, 83 (09) : S51 - S51
  • [42] Coding for Large-Scale Distributed Machine Learning
    Xiao, Ming
    Skoglund, Mikael
    [J]. ENTROPY, 2022, 24 (09)
  • [43] Large-scale Machine Learning over Graphs
    Yang, Yiming
    [J]. PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 9 - 9
  • [44] Robust Large-Scale Machine Learning in the Cloud
    Rendle, Steffen
    Fetterly, Dennis
    Shekita, Eugene J.
    Su, Bor-yiing
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1125 - 1134
  • [45] Resource Elasticity for Large-Scale Machine Learning
    Huang, Botong
    Boehm, Matthias
    Tian, Yuanyuan
    Reinwald, Berthold
    Tatikonda, Shirish
    Reiss, Frederick R.
    [J]. SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 137 - 152
  • [46] Optimization Methods for Large-Scale Machine Learning
    Bottou, Leon
    Curtis, Frank E.
    Nocedal, Jorge
    [J]. SIAM REVIEW, 2018, 60 (02) : 223 - 311
  • [47] TensorFlow: A system for large-scale machine learning
    Abadi, Martin
    Barham, Paul
    Chen, Jianmin
    Chen, Zhifeng
    Davis, Andy
    Dean, Jeffrey
    Devin, Matthieu
    Ghemawat, Sanjay
    Irving, Geoffrey
    Isard, Michael
    Kudlur, Manjunath
    Levenberg, Josh
    Monga, Rajat
    Moore, Sherry
    Murray, Derek G.
    Steiner, Benoit
    Tucker, Paul
    Vasudevan, Vijay
    Warden, Pete
    Wicke, Martin
    Yu, Yuan
    Zheng, Xiaoqiang
    [J]. PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, 2016, : 265 - 283
  • [48] Salinity and streamflow variability in the Mid-Atlantic region of the United States and its relationship with large-scale atmospheric circulation patterns
    Schulte, Justin A.
    Najjar, Raymond G.
    Lee, Sukyoung
    [J]. JOURNAL OF HYDROLOGY, 2017, 550 : 65 - 79
  • [49] Large-Scale Atmospheric Drivers of Snowfall Over Thwaites Glacier, Antarctica
    Maclennan, Michelle L.
    Lenaerts, Jan T. M.
    [J]. GEOPHYSICAL RESEARCH LETTERS, 2021, 48 (17)
  • [50] The utility of daily large-scale climate data in the assessment of climate change impacts on daily streamflow in California
    Maurer, E. P.
    Hidalgo, H. G.
    Das, T.
    Dettinger, M. D.
    Cayan, D. R.
    [J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2010, 14 (06) : 1125 - 1138