Derivation and analysis of parallel-in-time neural ordinary differential equations

被引:0
|
作者
E. Lorin
机构
[1] Université de Montréal,Centre de Recherches Mathématiques,
[2] Carleton University,School of Mathematics and Statistics
关键词
Residual Neural Network; Neural Ordinary Differential Equations; Parareal method; Parallelism-in-time; 65Y05; 65L20; 68T01; 68T20;
D O I
暂无
中图分类号
学科分类号
摘要
The introduction in 2015 of Residual Neural Networks (RNN) and ResNET allowed for outstanding improvements of the performance of learning algorithms for evolution problems containing a “large” number of layers. Continuous-depth RNN-like models called Neural Ordinary Differential Equations (NODE) were then introduced in 2019. The latter have a constant memory cost, and avoid the a priori specification of the number of hidden layers. In this paper, we derive and analyze a parallel (-in-parameter and time) version of the NODE, which potentially allows for a more efficient implementation than a standard/naive parallelization of NODEs with respect to the parameters only. We expect this approach to be relevant whenever we have access to a very large number of processors, or when we are dealing with high dimensional ODE systems. Moreover, when using implicit ODE solvers, solutions to linear systems with up to cubic complexity are then required for solving nonlinear systems using for instance Newton’s algorithm; as the proposed approach allows to reduce the overall number of time-steps thanks to an iterative increase of the accuracy order of the ODE system solvers, it then reduces the number of linear systems to solve, hence benefiting from a scaling effect.
引用
收藏
页码:1035 / 1059
页数:24
相关论文
共 50 条
  • [41] Parallel-in-time simulation of the unsteady Navier-Stokes equations for incompressible flow
    Trindade, JMF
    Pereira, JCF
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN FLUIDS, 2004, 45 (10) : 1123 - 1136
  • [42] Parallel-in-time relaxed Newton method for transient stability analysis
    Zhengzhou Univ, Zhengzhou City, China
    IEE Proc Gener Transm Distrib, 2 (155-159):
  • [43] A simplified derivation of the ordinary differential equations for the free surface Green functions
    Newman, J. N.
    APPLIED OCEAN RESEARCH, 2020, 94
  • [44] Parallel-in-time simulation of biofluids
    Liu, Weifan
    Rostami, Minghao W.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2022, 464
  • [45] Predict globally, correct locally: Parallel-in-time optimization of neural networks
    Parpas, Panos
    Muir, Corey
    AUTOMATICA, 2025, 171
  • [46] Parallel-in-time optimization of induction motors
    Hahne, Jens
    Polenz, Bjoern
    Kulchytska-Ruchka, Iryna
    Friedhoff, Stephanie
    Ulbrich, Stefan
    Schoeps, Sebastian
    JOURNAL OF MATHEMATICS IN INDUSTRY, 2023, 13 (01)
  • [47] Stabilized neural ordinary differential equations for long-time forecasting of dynamical systems
    Linot, Alec J.
    Burby, Joshua W.
    Tang, Qi
    Balaprakash, Prasanna
    Graham, Michael D.
    Maulik, Romit
    JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 474
  • [48] Continuous time and dynamic suicide attempt risk prediction with neural ordinary differential equations
    Sheu, Yi-han
    Simm, Jaak
    Wang, Bo
    Lee, Hyunjoon
    Smoller, Jordan W.
    NPJ DIGITAL MEDICINE, 2025, 8 (01):
  • [49] Neural Ordinary Differential Equations for Hyperspectral Image Classification
    Paoletti, Mercedes E.
    Mario Haut, Juan
    Plaza, Javier
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (03): : 1718 - 1734
  • [50] Local parameter identification with neural ordinary differential equations
    Qiang Yin
    Juntong Cai
    Xue Gong
    Qian Ding
    Applied Mathematics and Mechanics, 2022, 43 : 1887 - 1900