Derivation and analysis of parallel-in-time neural ordinary differential equations

被引:0
|
作者
E. Lorin
机构
[1] Université de Montréal,Centre de Recherches Mathématiques,
[2] Carleton University,School of Mathematics and Statistics
关键词
Residual Neural Network; Neural Ordinary Differential Equations; Parareal method; Parallelism-in-time; 65Y05; 65L20; 68T01; 68T20;
D O I
暂无
中图分类号
学科分类号
摘要
The introduction in 2015 of Residual Neural Networks (RNN) and ResNET allowed for outstanding improvements of the performance of learning algorithms for evolution problems containing a “large” number of layers. Continuous-depth RNN-like models called Neural Ordinary Differential Equations (NODE) were then introduced in 2019. The latter have a constant memory cost, and avoid the a priori specification of the number of hidden layers. In this paper, we derive and analyze a parallel (-in-parameter and time) version of the NODE, which potentially allows for a more efficient implementation than a standard/naive parallelization of NODEs with respect to the parameters only. We expect this approach to be relevant whenever we have access to a very large number of processors, or when we are dealing with high dimensional ODE systems. Moreover, when using implicit ODE solvers, solutions to linear systems with up to cubic complexity are then required for solving nonlinear systems using for instance Newton’s algorithm; as the proposed approach allows to reduce the overall number of time-steps thanks to an iterative increase of the accuracy order of the ODE system solvers, it then reduces the number of linear systems to solve, hence benefiting from a scaling effect.
引用
收藏
页码:1035 / 1059
页数:24
相关论文
共 50 条
  • [31] Solving Ordinary Differential Equations by neural network
    Liu, BA
    Jammes, B
    ESM'99 - MODELLING AND SIMULATION: A TOOL FOR THE NEXT MILLENNIUM, VOL II, 1999, : 437 - 441
  • [32] Modeling Trajectories with Neural Ordinary Differential Equations
    Liang, Yuxuan
    Ouyang, Kun
    Yan, Hanshu
    Wang, Yiwei
    Tong, Zekun
    Zimmermann, Roger
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1498 - 1504
  • [33] Tensorial Time Series Prediction via Tensor Neural Ordinary Differential Equations
    Bai, Mingyuan
    Zhao, Qibin
    Gao, Junbin
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [34] Ordinary fractional differential equations are in fact usual entire ordinary differential equations on time scales
    Damasceno, Berenice C.
    Barbanti, Luciano
    10TH INTERNATIONAL CONFERENCE ON MATHEMATICAL PROBLEMS IN ENGINEERING, AEROSPACE AND SCIENCES (ICNPAA 2014), 2014, 1637 : 279 - 282
  • [36] Parallel evolutionary modeling for nonlinear ordinary differential equations
    Kang, Z.
    Liu, P.
    Kang, L.-S.
    Wuhan University Journal of Natural Sciences, 2001, 6 (03) : 659 - 664
  • [37] PARALLEL METHODS FOR NUMERICAL INTEGRATION OF OF ORDINARY DIFFERENTIAL EQUATIONS
    MIRANKER, WL
    LINIGER, W
    MATHEMATICS OF COMPUTATION, 1967, 21 (99) : 303 - &
  • [38] Convergence analysis for parallel-in-time solution of hyperbolic systems
    De Sterck, Hans
    Friedhoff, Stephanie
    Howse, Alexander J. M.
    MacLachlan, Scott P.
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2020, 27 (01)
  • [39] PARALLEL-IN-TIME MAGNUS INTEGRATORS
    Krull, B. T.
    Minion, M. L.
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2019, 41 (05): : A2999 - A3020
  • [40] TIME AVERAGING FOR ORDINARY DIFFERENTIAL EQUATIONS AND RETARDED FUNCTIONAL DIFFERENTIAL EQUATIONS
    Lakrib, Mustapha
    Sari, Tewfik
    ELECTRONIC JOURNAL OF DIFFERENTIAL EQUATIONS, 2010,