Identifying major drivers of daily streamflow from large-scale atmospheric circulation with machine learning

被引：23

作者：

Hagen, Jenny Sjastad ^{[1
,2
]}

Leblois, Etienne ^{[3
]}

Lawrence, Deborah ^{[4
]}

Solomatine, Dimitri ^{[5
]}

Sorteberg, Asgeir ^{[1
,2
]}

机构：

[1] Univ Bergen, Geophys Inst, Allegaten 70, N-5007 Bergen, Norway

[2] Univ Bergen, Bjerknes Ctr Climate Res, Jahnebakken 5, N-5007 Bergen, Norway

[3] French Natl Inst Agr Food & Environm INRAE, Riverly Lyon Res Unit, 5 Rue Doua, F-69625 Villeurbanne, France

[4] Norwegian Water Resources & Energy Directorate NV, Middelthuns Gate 29, N-0368 Oslo, Norway

[5] IHE Delft Inst Water Educ, Westvest 7, NL-2611 AX Delft, Netherlands

来源：

JOURNAL OF HYDROLOGY | 2021年 / 596卷

关键词：

Direct downscaling; Discharge; Automated feature extraction; ERA5; ERA-Interim; SUPPORT-VECTOR; CLIMATE; PERFORMANCE; MODELS; FLOODS;

D O I：

10.1016/j.jhydrol.2021.126086

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Previous studies linking large-scale atmospheric circulation and river flow with traditional machine learning techniques have predominantly explored monthly, seasonal or annual streamflow modelling for applications in direct downscaling or hydrological climate-impact studies. This paper identifies major drivers of daily streamflow from large-scale atmospheric circulation using two reanalysis datasets for six catchments in Norway representing various Koppen-Geiger climate types and flood-generating processes. A nested loop of roughly pruned random forests is used for feature extraction, demonstrating the potential for automated retrieval of physically consistent and interpretable input variables. Random forest (RF), support vector machine (SVM) for regression and multilayer perceptron (MLP) neural networks are compared to multiple-linear regression to assess the role of model complexity in utilizing the identified major drivers to reconstruct streamflow. The machine learning models were trained on 31 years of aggregated atmospheric data with distinct moving windows for each catchment, reflecting catchment-specific forcing-response relationships between the atmosphere and the rivers. The results show that accuracy improves to some extent with model complexity. In all but the smallest, rainfall-driven catchment, the most complex model, MLP, gives a Nash-Sutcliffe Efficiency (NSE) ranging from 0.71 to 0.81 on testing data spanning five years. The poorer performance by all models in the smallest catchment is discussed in relation to catchment characteristics, sub-grid topography and local variability. The intra-model differences are also viewed in relation to the consistency between the automatically retrieved feature selections from the two reanalysis datasets. This study provides a benchmark for future development of deep learning models for direct downscaling from large-scale atmospheric variables to daily streamflow in Norway.

引用

页数：22

共 50 条

[41] Large-Scale Machine Learning and Neuroimaging in Psychiatry
Thompson, Paul
[J]. BIOLOGICAL PSYCHIATRY, 2018, 83 (09) : S51 - S51
[42] Coding for Large-Scale Distributed Machine Learning
Xiao, Ming
Skoglund, Mikael
[J]. ENTROPY, 2022, 24 (09)
[43] Large-scale Machine Learning over Graphs
Yang, Yiming
[J]. PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 9 - 9
[44] Robust Large-Scale Machine Learning in the Cloud
Rendle, Steffen
Fetterly, Dennis
Shekita, Eugene J.
Su, Bor-yiing
[J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1125 - 1134
[45] Resource Elasticity for Large-Scale Machine Learning
Huang, Botong
Boehm, Matthias
Tian, Yuanyuan
Reinwald, Berthold
Tatikonda, Shirish
Reiss, Frederick R.
[J]. SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 137 - 152
[46] Optimization Methods for Large-Scale Machine Learning
Bottou, Leon
Curtis, Frank E.
Nocedal, Jorge
[J]. SIAM REVIEW, 2018, 60 (02) : 223 - 311
[47] TensorFlow: A system for large-scale machine learning
Abadi, Martin
Barham, Paul
Chen, Jianmin
Chen, Zhifeng
Davis, Andy
Dean, Jeffrey
Devin, Matthieu
Ghemawat, Sanjay
Irving, Geoffrey
Isard, Michael
Kudlur, Manjunath
Levenberg, Josh
Monga, Rajat
Moore, Sherry
Murray, Derek G.
Steiner, Benoit
Tucker, Paul
Vasudevan, Vijay
Warden, Pete
Wicke, Martin
Yu, Yuan
Zheng, Xiaoqiang
[J]. PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, 2016, : 265 - 283
[48] Salinity and streamflow variability in the Mid-Atlantic region of the United States and its relationship with large-scale atmospheric circulation patterns
Schulte, Justin A.
Najjar, Raymond G.
Lee, Sukyoung
[J]. JOURNAL OF HYDROLOGY, 2017, 550 : 65 - 79
[49] Large-Scale Atmospheric Drivers of Snowfall Over Thwaites Glacier, Antarctica
Maclennan, Michelle L.
Lenaerts, Jan T. M.
[J]. GEOPHYSICAL RESEARCH LETTERS, 2021, 48 (17)
[50] The utility of daily large-scale climate data in the assessment of climate change impacts on daily streamflow in California
Maurer, E. P.
Hidalgo, H. G.
Das, T.
Dettinger, M. D.
Cayan, D. R.
[J]. HYDROLOGY AND EARTH SYSTEM SCIENCES, 2010, 14 (06) : 1125 - 1138

← 1 2 3 4 5 →