Long-term stability and generalization of observationally-constrained stochastic data-driven models for geophysical turbulence

被引:0
|
作者
Chattopadhyay, Ashesh [1 ]
Pathak, Jaideep [2 ]
Nabizadeh, Ebrahim [1 ]
Bhimji, Wahid [3 ]
Hassanzadeh, Pedram [1 ]
机构
[1] Rice Univ, Dept Mech Engn, Houston, TX 77005 USA
[2] NVIDIA, Santa Clara, CA USA
[3] Lawrence Berkeley Natl Lab, Berkeley, CA USA
来源
关键词
Data-driven climate model; long-term stability; transfer learning; variational autoencoder; WEATHER;
D O I
10.1017/eds.2022.30
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Recent years have seen a surge in interest in building deep learning-based fully data-driven models for weather prediction. Such deep learning models, if trained on observations can mitigate certain biases in current state-of-the-art weather models, some of which stem from inaccurate representation of subgrid-scale processes. However, these data-driven models, being over-parameterized, require a lot of training data which may not be available from reanalysis (observational data) products. Moreover, an accurate, noise-free, initial condition to start forecasting with a data-driven weather model is not available in realistic scenarios. Finally, deterministic data-driven forecasting models suffer from issues with long-term stability and unphysical climate drift, which makes these data-driven models unsuitable for computing climate statistics. Given these challenges, previous studies have tried to pre-train deep learning-based weather forecasting models on a large amount of imperfect long-term climate model simulations and then re-train them on available observational data. In this article, we propose a convolutional variational autoencoder (VAE)-based stochastic data-driven model that is pre-trained on an imperfect climate model simulation from a two-layer quasi-geostrophic flow and re-trained, using transfer learning, on a small number of noisy observations from a perfect simulation. This re-trained model then performs stochastic forecasting with a noisy initial condition sampled from the perfect simulation. We show that our ensemble-based stochastic data-driven model outperforms a baseline deterministic encoder-decoder-based convolutional model in terms of short-term skills, while remaining stable for long-term climate simulations yielding accurate climatology. Impact Statement A stochastic VAE-based data-driven model pre-trained on imperfect climate simulations and re-trained with transfer learning, on a limited number of observations, leads to accurate short-term weather forecasting along with long-term stable non-drifting climate.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Assessing the potential of data-driven models for estimation of long-term monthly temperatures
    Mehdizadeh, Saeid
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2018, 144 : 114 - 125
  • [2] Data-driven prediction of long-term deterioration of RC bridges
    Alonso Medina, Pablo
    Leon Gonzalez, Francisco Javier
    Todisco, Leonardo
    [J]. CONSTRUCTION AND BUILDING MATERIALS, 2022, 317
  • [3] Data-driven modeling for long-term electricity price forecasting
    Gabrielli, Paolo
    Wuthrich, Moritz
    Blume, Steffen
    Sansavini, Giovanni
    [J]. ENERGY, 2022, 244
  • [4] A Data-Driven Approach for Long-Term Building Energy Demand Prediction
    Wang, Lufan
    El-Gohary, Nora M.
    [J]. CONSTRUCTION RESEARCH CONGRESS 2020: COMPUTER APPLICATIONS, 2020, : 1165 - 1173
  • [5] Data-Driven Robust Barrier Functions for Safe, Long-Term Operation
    Emam, Yousef
    Glotfelter, Paul
    Wilson, Sean
    Notomista, Gennaro
    Egerstedt, Magnus
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2022, 38 (03) : 1671 - 1685
  • [6] Study on the evolution law of performance of mid- to long-term streamflow forecasting based on data-driven models
    Fang, Wei
    Zhou, Jian-zhong
    Jia, Ben-Jun
    Gu, Lei
    Xu, Zhan-xing
    [J]. SUSTAINABLE CITIES AND SOCIETY, 2023, 88
  • [7] Combination of Multiple Data-Driven Models for Long-Term Monthly Runoff Predictions Based on Bayesian Model Averaging
    Huaping Huang
    Zhongmin Liang
    Binquan Li
    Dong Wang
    Yiming Hu
    Yujie Li
    [J]. Water Resources Management, 2019, 33 : 3321 - 3338
  • [8] Combination of Multiple Data-Driven Models for Long-Term Monthly Runoff Predictions Based on Bayesian Model Averaging
    Huang, Huaping
    Liang, Zhongmin
    Li, Binquan
    Wang, Dong
    Hu, Yiming
    Li, Yujie
    [J]. WATER RESOURCES MANAGEMENT, 2019, 33 (09) : 3321 - 3338
  • [9] A Study of Long-Term fMRI Reproducibility Using Data-Driven Analysis Methods
    Song, Xiaomu
    Panych, Lawrence P.
    Chou, Ying-Hui
    Chen, Nan-Kuei
    [J]. INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2014, 24 (04) : 339 - 349
  • [10] A Data-driven Long-Term Dynamic Rating Estimating Method for Power Transformers
    Dong, Ming
    [J]. IEEE TRANSACTIONS ON POWER DELIVERY, 2021, 36 (02) : 686 - 697